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PATENT 

Docket No.: 19603/606 (CRT D-1657B) 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant(s) 



David M. Soderlund, Douglas C. Knipple, and 
Patricia J. Ingles 



Examiner: 
To Be Assigned 



Serial No. 



To Be Assigned (Division of Serial 

No. 08/772,512, filed December 24, 1996) 



Art Unit: 
To Be Assigned 



Filed 



Herewith 



For 



INSECT SODIUM CHANNELS FROM 
INSECTICIDE-SUSCEPTIBLE AND 
INSECTICIDE-RESISTANT HOUSE FLIES 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box: Patent Application 

Dear Sir: 

Please amend the above-identified patent application as follows: 
In the Specification : 

On page 1, line 8, after "This application is a", insert -divisional application of Serial 
No. 08/772,512, filed on December 24, 1996, which is a-. 

On page 6, line 29, replace " " with -97831--. 

On page 6, line 32, replace " " with - 97832-. 

On page 8, line 23, replace " " with -9783 1-. 

On page 8, line 24, replace " " with -97832--. 

On page 8, line 25, replace "December " with -December 20--. 

On page 28, line 1, replace " " with -9783 1-. 

On page 28, line 4, replace " " with -97832-. 

In the Claims : 

Please cancel claims 1-40 and 53-77, without prejudice. 
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Please amend claim 41, as follows: 

4 1 . (Amended) A method of screening a chemical agent for the ability of the 
chemical agent to modify sodium channel function, said method comprising: 

introducing an isolated nucleic acid molecule encoding a voltage-sensitive 
sodium channel of Musca domestica, wherein said nucleic acid molecule hybridizes 
to a nucleic acid molecule, having a nucleotide sequence according to bases 1 to 101 1 
or 1321 to 5030 of SEP. ID. No. 1 or 3 at 42°. with 5 x SSPC and 50% formamide 
with washi ng at 65° C with 0.5 x SSPC [the nucleic acid molecule of claim 1] into a 
host cell; 

expressing said voltage-sensitive sodium channel encoded by said nucleic acid 
molecule in the host cell so as to result in the functional expression of a voltage- 
sensitive sodium channel in the host cell; 

exposing the host cell to a chemical agent; and 

evaluating the exposed host cell to determine if the chemical agent modifies 
the function of the voltage- sensitive sodium channel. 

Please add new claims 78-83, as follows: 

78. (New) The method according to claim 41, wherein said voltage- 
sensitive sodium channel confers susceptibility to an insecticide in Musca domestica. 

79. (New) The method according to claim 78, wherein said nucleic acid 
molecule has a nucleotide sequence as shown in SEQ ID NO: 1 . 

80. (New) The method according to claim 78, wherein said nucleic acid 
molecule encodes an amino acid sequence as shown in SEQ ID NO:3. 

8 1 . (New) The method according to claim 4 1 , wherein said voltage- 
sensitive sodium channel confers resistance to an insecticide in Musca domestica. 

82. (New) The method according to claim 81, wherein said nucleic acid 
molecule has a nucleotide sequence as shown in SEQ ID NO:2. 



83. (New) The method according to claim 41 , wherein said nucleic acid 
molecule encodes an amino acid sequence as shown in SEQ ID NO:4. 



In view of the above amendments, it is submitted that this case is in condition for 
allowance, and such allowance is earnestly solicited. 



NIXON PEABODY LLP 
Clinton Square, P.O. Box 1051 
Rochester, New York 14603 
Telephone: (716)263-1741 
Facsimile: (716)263-1600 



REMARKS 



Respectfully submitted, 
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INSECT SODIUM CHANNELS FROM INSECTICIDE- SUSCEPTIBLE 
AND INSECTICIDE -RESISTANT HOUSE FLIES 



5 The subject matter of this application was made 

with support from the United States Government under USDA 
Grant No. 94-37302-0408. 

This application is a continuation-in-part of 
U.S. Serial No. 08/608,618, filed March 1, 1996, the 
10 contents of which are hereby incorporated by reference. 

FIELD OF THE INVENTION 

The present invention relates generally to 
insect sodium channel proteins, and more particularly to 
15 insecticide-susceptible and insecticide-resistant voltage- 
sensitive sodium channels of the house fly Musca. 
domes tica. . 

BACKGROUND OF THE INVENTION 

2 0 Throughout this application various 

publications are referenced, many in parenthesis. Full 
citations for these publications are provided at the end 
of the Detailed Description. The disclosures of these 
publications in their entireties are hereby incorporated 
25 by reference in this application. 

Cell membranes must allow passage of various 
polar molecules, including ions, sugars, amino acids, and 
nucleotides. Special membrane proteins are responsible 
for transferring such molecules across cell membranes. 

3 0 These proteins, referred to as membrane transport 

proteins, occur in many forms and in all types of 
biological membranes. Each protein is specific in that it 
transports a particular class of molecules (such as ions, 
sugars, or amino acids) and often only certain molecular 
35 species of the class. All membrane transport proteins 
that have been studied in detail have been found to be 
multipass transmembrane proteins. By forming a continuous 



o 
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protein pathway across the membrane, these proteins enable 
the specific molecules to cross the membrane without 
coming into direct contact with the hydrophobic interior 
of the lipid bilayer of the plasma membrane. 



transport proteins : carrier proteins and channel 
proteins. Carrier proteins bind the specific molecule to 
be transported and undergo a series of conformational 
changes in order to transfer the bound molecule across the 

10 membrane. Channel proteins, on the other hand, need not 
bind the molecule. Instead, they form hydrophilic pores 
that extend across the lipid bilayer; when these pores are 
open, they allow specific molecules (usually inorganic 
ions of appropriate size and charge) to pass through them 

15 and thereby cross the membrane. Transport through channel 
proteins occurs at a much faster rate than transport 
mediated by carrier proteins . 



specifically with inorganic ion transport are referred to 

2 0 as ion channels, and include ion channels for sodium, 

potassium, calcium, and chloride ions. Ion channels which 
open in response to a change in the voltage across the 
membrane are referred to as voltage- sensitive ion 
channels . 

25 The sodium channel is one of the most 

thoroughly characterized of the voltage-sensitive channels 
(see Fig. 1 for a model of a voltage-sensitive sodium 
channel) . In vertebrates, sodium channels in the brain, 
muscle, and other tissues are large ' membrane glycoprotein 

3 0 complexes composed of an alpha subunit (23 0-270 kDa) and 

1-2 tightly associated smaller (33-38 kDa) beta subunits 
(reviewed by Catterall 1992) . The large alpha subunit 
forms the ion permeable pore while the smaller subunits 
play key roles in the regulation of channel function (Isom 
35 et al . 1992; reviewed by Isom et al . 1994) . The alpha 



5 



There are two major classes of membrane 



Channel proteins which are concerned 
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subunit is common to purified channel preparations from 
Electrophorus electricus {electric eel) electric organ 
(Noda et al. 1984), rat brain (Noda et al . 1986), rat 
skeletal muscle (Barchi 1988) and chick heart muscle 
5 (Catterall 1986) . Other studies have revealed the 

existence of multiple closely related isoforms of the 
sodium channel found in different animal species, in 
different tissues within the same species, and even in the 
same tissue (Catterall et al . 1981; Frelin et al . 1984; 

10 Rogart 1986; Moczydlowski et al . 1986). 

The structure of invertebrate sodium channels 
is not as well defined. Gene cloning studies have 
established the existence of alpha subunits of structure 
similar to those described for vertebrates (Loughney et 

15 al . 1989; Ramaswami and Tanouye 1989; Okamoto et al . 

1987) . Analysis of the para behavioral mutant (paralytic; 
Suzuki et al . 1971) of Drosophila melanogaster revealed 
that the para gene encodes a Drosophila sodium channel 
alpha subunit (Loughney et al . 1989) . The entire para 

20 cDNA sequence was determined (Loughney et al . 1989; 
Thackeray and Ganetzky 1994) . 

The kdr mutant of the house fly Musca domes tica 
has also been studied. The kdr insecticide resistance 
trait of the house fly confers reduced neuronal 

25 sensitivity to the rapid paralytic and lethal actions of 
DDT and pyrethroid insecticides (Soderlund and Bloomquist 
1990) . Because these insecticides are known to modify 
neuronal excitability by altering the inactivation 
kinetics of voltage-sensitive sodium channels (Soderlund 

30 and Bloomquist 1989; Bloomquist 1993), efforts to identify 
the molecular basis of kdr resistance have focused on the 
pharmacology and structure of this target . 

Recently, tight genetic linkage between the kdr 
trait and a restriction fragment length polymorphism 

35 located within a segment of the house fly homolog of the 



para gene of Drosophila melanogaster was demonstrated 
(Knipple et al . 1994). Similar linkage studies have also 
documented tight linkage of the super-kdr resistance trait 
of the house fly {Williamson et al . 1993) to molecular 
markers lying within the para-homologous voltage-sensitive 
sodium channel gene. 

Elucidation of the structure of the house fly 
sodium channel gene will enable the screening of potential 
insecticidal agents which act upon the sodium channel . 

A need continues to exist, therefore, for the 
determination of the primary structure of the house fly 
sodium channel, i.e. the nucleotide and amino acid 
sequences of the channel . 

SUMMARY OF INVENTION 

To this end, the subject invention provides the 
6318 nucleotide coding sequence (SEQ I D NO : 1 ) of the 
voltage-sensitive sodium channel gene from insecticide- 
susceptible (NAIDM strain) house flies {Musca domestica.) , 
determined by automated direct DNA sequencing of PCR 
fragments obtained by amplification on first strand cDNA 
from adult heads. The deduced 2105-residue amino acid 
sequence (SEQ ID NO : 3 ) exhibits overall structure and 
organization typical of -sodium channel alpha subunit genes 
and is 90.0% identical to that of the D. melanogaster para 
gene product. There is no evidence for the existence of 
multiple splice variants among voltage-sensitive sodium 
channel cDNAs obtained from adult house fly head 
preparations. Comparison of the coding sequence of the 
voltage-sensitive sodium channel gene of the kdr 
insecticide-resistant house -fly strain (538ge strain) to 
that of the NAIDM strain reveals 12 amino acid differences 
in the 538ge strain. The amino acid sequence (SEQ ID 
NO: 4) of the Kdr strain is only 2104 residues in length, 
as a result of five (5) amino acid substitutions, four (4) 



amino acid deletions, and three (3) amino acid insertions 
as compared to the 2105-residue amino acid sequence (SEQ 
ID NO: 3) of the NAIDM strain. The nucleotide sequence 
(SEQ ID NO:2) of the Kdr strain is therefore 6315 
5 nucleotides in length, which is three nucleotides shorter 
than the nucleotide sequence (SEQ ID NO:l) of the NAIDM 
strain . 

More particularly, the subject invention 
provides an isolated nucleic acid molecule encoding a 

10 voltage-sensitive sodium channel of Musca domestica r 

wherein the voltage-sensitive sodium channel is capable of 
conferring sensitivity or resistance to an insecticide in 
Musca domestica . In one embodiment, the nucleic acid 
molecule confers insecticide susceptibility to the house 

15 fly, and in another embodiment the nucleic acid molecule 
confers insecticide resistance to the house fly. The 
nucleic acid molecule conferring insecticide resistance is 
preferably a mutated form of the nucleic acid molecule 
encoding the insecticide susceptible channel . The 

2 0 invention also provides an antisense nucleic acid molecule 
complementary to mRNA encoding the voltage-sensitive 
sodium channel of Musca domestica. 

The isolated nucleic acid molecules of the 
invention can be inserted into suitable expression vectors 

25 and/or host cells. Expression of the nucleic acid 
molecules encoding the sodium channels results in 
production of functional sodium channels in a host cell. 
Expression of the antisense nucleic acid molecules or 
fragments thereof in a host cell results in decreased 

30 expression of the functional sodium channels. 

The invention further provides a ribozyme 
having a recognition sequence complementary to a portion 
of mRNA encoding a voltage-sensitive sodium channel of 
Musca domestica . The ribozyme can be introduced into a 
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cell to also achieve decreased expression of sodium 
channels in the cell. 

The invention further provides a method of 
screening a chemical agent for the ability of the chemical 
5 agent to modify sodium channel function, and a method of 
obtaining DNA encoding a voltage -sensitive sodium channel 
of Musca domestica . 

Further provided is an isolated nucleic acid 
molecule encoding a voltage-sensitive sodium channel of an 

10 insect, wherein the nucleic acid molecule encodes a first 
amino acid sequence having at least 95% amino acid 
identity to a second amino acid sequence. The second 
amino acid sequence is, in two preferred embodiments, SEQ 
ID NO: 3 or SEQ ID NO : 4 . 

15 The invention also provides an isolated 

voltage- sensitive sodium channel of Musca domestica., and 
antibodies or antibody fragments specific for the sodium 
channel . The antibodies or antibody fragments can be used 
to detect the presence of the sodium channel in samples . 

20 Further provided is an isolated voltage- 

sensitive sodium channel of Musca domestica., wherein the 
voltage-sensitive sodium channel is comprised of a protein 
having a first amino acid sequence with at least 95% amino 
acid identity to a second amino acid sequence. In two 

25 preferred embodiments, the second amino acid sequence is 
SEQ ID NO : 3 or SEQ ID NO : 4 . 

Also provided by the subject invention is a 
plasmid designated pPJIl and deposited with the ATCC under 
Accession No. , as well as a Kpnl/Aatll restriction 

30 fragment of about 3620 bp of the plasmid designated pPJIl . 
Further provided is a plasmid designated pPJI2 and 

deposited with the ATCC under Accession No. , as well 

as an Aatll/Sphll restriction fragment of about 2700 bp of 
the plasmid designated pPJI2 . When the above two 

3 5 restriction fragments are ligated together at their AacII 



sites, the resulting nucleic acid molecule encodes a 
voltage-sensitive sodium channel which confers 
susceptibility to an insecticide in Musca domestica . This 
resulting nucleic acid molecule is also provided by the 
5 subject invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features and advantages of this 
invention will be evident from the following detailed 
10 description of preferred embodiments when read in 

conjunction with the accompanying drawings in which: 

Fig. 1 is a model of a voltage sensitive sodium 
channel from mammalian brain in the plasma membrane. The 
alpha and beta x subunits interact noncovalently ; the alpha 
15 and beta 2 subunits are linked by disulfide bonds. The- 
branched structures at the outer surface of the channel 
represent oligosaccharides; 

Fig. 2 is a diagram of the structural 
organization of the voltage-sensitive sodium channel 

2 0 coding sequence of Musca domestica {Vsscl) showing 

repeated homology domains I -IV and putative transmembrane 
helices (rectangles) . Shown below the structural 
organization are the relative length and location of the 
previously-described 3 09-nucleotide exon of Vsscl (Knipple 
25 et al . 1994) (exon) and seven overlapping PCR-amplif ied 
cDNA fragments (A-G) employed as templates for DNA 
sequencing; 

Fig. 3 shows the alignment of the predicted 
amino acid sequences of Vsscl NAIDM (NAIDM) and Vsscl 5389 * 

3 0 (53 8ge) with that of the a + b~ c~d + e~ f~h~ i + splice variant of 

the D. melanogas ter para sequence (para) obtained using 
the DNASTAR computer program (Clustal method) . Residues 
that are identical to the NAIDM sequence in both 53 8ge and 
para are indicated as dashes (-) in the latter two 
35 sequences; gaps introduced to obtain optimal alignment are 



indicated as periods ( . ) . The locations of 24 putative 
helical transmembrane domains (e.g., IS1, IS2 , etc.) and 
four putative pore -forming domains (e.g., IP, IIP) are 
marked by solid bars above the NAIDM sequence. Also 
5 marked above the NAIDM sequence are possible sites for .re- 
linked glycosylation (#) , cAMP -dependent protein kinase 
phosphorylation (*), and protein kinase C phosphorylation 
(•) ; and 

Fig. 4 is a diagram of the Vsscl gene product 
10 showing the locations of 12 amino acid differences 

identified in the Vsscl 52 * 3 * sequence, including 5 amino 
acid substitutions, 4 amino acid deletions, and 3 amino 
acid insertions in the Vsscl 538ge sequence (R) as compared 
to the Vsscl NAIDM sequence (S) . 

15 

DETAILED DESCRIPTION 

The plasmids designated pPJIl and pPJI2 have 
each been deposited pursuant to, and in satisfaction of, 
the requirements of the Budapest Treaty on the 

20 International Recognition of the Deposit of Microorganisms 
for the Purposes of Patent Procedure, with the American 
Type Culture Collection (ATCC) , 123 01 Parklawn Drive, 

Rockville, Maryland, 20852 under ATCC Accession No. 

(pPJIl) and ATCC Accession No. (pPJI2) . Both 

25 deposits were made on December , 1996. 

As used herein, the term "isolated" when used 
in conjunction with a nucleic acid molecule refers to: 1) 
a nucleic acid molecule which has been separated from an 
organism in a substantially purified form (i.e. 

3 0 substantially free of other substances originating from 
that organism) , or 2) a nucleic acid molecule having the 
same nucleotide sequence but not necessarily separated 
from the organism (i.e. synthesized nucleic acid 
molecules) . The term "isolated" when used in conjunction 

3 5 with a channel refers to a channel encoded by such an 



"isolated" nucleic acid molecule, generally expressed in a 
membrane, such as a plasma membrane within a cell or a 
synthetic lipid bilayer membrane. The expressed 
"isolated" channel has the pharmacological properties of a 
5 functional sodium channel. 

As further used herein, the terms 
"corresponding to" or "having" or "as shown in" when used 
in conjunction with a SEQ ID NO for a nucleotide sequence 
refer to a nucleotide sequence which is substantially the 
10 same nucleotide sequence, or derivatives or equivalents 
thereof (such as deletion and hybrid variants thereof, 
splice variants thereof, etc.) . Nucleotide additions, 
deletions, and/or substitutions, such as those which do 
not affect the translation of the DNA molecule, are within 
15 the scope of a nucleotide sequence corresponding to or 
having or as shown in a particular nucleotide sequence 
(i.e. the amino acid sequence encoded thereby remains the 
same) . Such additions, deletions, and/or substitutions 
can be, for example, point mutations made according to 
20 methods known to those skilled in the art. It is also 

possible to substitute a nucleotide which alters the amino 
acid sequence encoded thereby, where the amino acid 
substituted is a conservative substitution or where amino 
acid homology is conserved. It is also possible to have 
25 minor nucleotide additions, deletions, and/or 

substitutions which do not alter the function of the 
resulting VSSC. Similarly, the term "corresponding to" or 
"having" or "as shown in" when used in conjunction with a 
SEQ ID NO for an amino acid sequence refers to an amino 
3 0 acid sequence which is substantially the same amino acid 
sequence or derivatives or equivalents thereof. Amino 
acid additions, deletions, and/or substitutions which do 
not negate the ability of the resulting protein to form a 
functional sodium channel are within the scope of an amino 
35 acid sequence corresponding to or having or as shown in a 
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particular amino acid sequence. Such additions, 
deletions, and/or substitutions can be, for example, the 
result of point mutations in the DNA encoding the amino 
acid sequence, such point mutations made according to 
5 methods known to those skilled in the art. Substitutions 
may be conservative substitutions of amino acids. As used 
herein, two amino acid residues are conservative 
substitutions of one another where the two residues are of 
the same type. In this regard, for purposes of the 

10 present invention, proline, alanine, glycine, serine, and 
threonine, all of which are neutral, weakly hydrophobic 
residues, are of the same type. Glutamine, glutamic acid, 
asparagine, and aspartic acid, all of which are acidic, 
hydrophilic residues, are of the same type. Another type 

15 of residue is the basic, hydrophilic amino acid residues, 
which include histidine, lysine, and arginine . Leucine, 
isoleucine, valine, and methionine all of which are 
hydrophobic, aliphatic amino acid residues, form yet 
another type of residue. Yet another type of residue 

20 consists of phenylalanine, tyrosine, and tryptophan, all 
of which are hydrophobic, aromatic residues. Further 
descriptions of the concept of conservative substitutions 
are given by French and Robson 19 83, Taylor 1986, and 
Bordo and Argos 19 91. 

2 5 As further used herein, the term "corresponding 

to" or "having" or "as shown in" or "consisting of" when 
used in conjunction with a SEQ ID NO for a nucleotide or 
amino acid sequence is intended to cover linear or cyclic 
versions of the recited sequence (cyclic referring to 

3 0 entirely cyclic versions or versions in which only a 

portion of the molecule is cyclic, including, for example, 
a single amino acid cyclic upon itself) , and is intended 
to cover derivative or modified nucleotides or amino acids 
within the recited sequence. For example, those skilled 
3 5 in the art will readily understand that an adenine 



nucleotide could be replaced with a methyladenine, or a 
cytosine nucleotide could be replaced with a 
methylcytosine, if a methyl side chain is desirable. 
Nucleotide sequences having a given SEQ ID NO are intended 
to encompass nucleotide sequences containing these and 
like derivative or modified nucleotides, as well as cyclic 
variations. As a further example, those skilled in the 
art will readily understand that an asparagine residue 
could be replaced with an ethylasparagine if an ethyl side 
chain is desired, a lysine residue could be replaced with 
a hydroxylysine if an OH side chain is desired, or a 
valine residue could be replaced with a methylvaline if a 
methyl side chain is desired. Amino acid sequences having 
a given SEQ ID NO are intended to encompass amino acid 
sequences containing these and like derivative or modified 
amino acids, as well as cyclic variations. Cyclic, as 
used herein, also refers to cyclic versions of the 
derivative or modified nucleotides and amino acids. 

The function of the encoded sodium channel can 
be assayed according to methods known in the art, such as 
by voltage clamp analysis of the channel following the 
functional expression of the channel in oocytes of the 
frog Xenopus laevis (see Taglialatela et al . 1992 and 
Stuhmer 1992 for a general discussion of the voltage clamp 
analysis of receptors and ion channels expressed in 
Xenopus oocytes) . As used herein, "functional expression" 
refers to the synthesis and any necessary post- 
translational processing of a sodium channel molecule in a 
host cell so that the channel is inserted properly in the 
cell membrane and is capable of conducting sodium ions in 
response to an experimentally- imposed change in the cell 
membrane potential or upon exposure to appropriate 
pharmacological agents. 

As further used herein, "sensitivity" and 
"resistance" refer to the relative responses of 
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genetically-defined insect populations to the paralytic or 
lethal actions of a test insecticide. For example, a dose 
of DDT [1, 1-bis- (4-chlorophenyl) -2 , 2 , 2 - trichloroethane] of 
approximately 0.02 //g per adult fly will kill 
5 approximately 5 0% of the treated individuals of a 

susceptible (Cooper-S) house fly strain, whereas doses of 
approximately 0.5 /xg per adult fly are required to kill 
approximately 50% of the treated individuals of a 
resistant (538ge) house fly strain {Sawicki 1978) . The 

10 absolute doses that define susceptibility and resistance 
vary with the insect species and genetically defined 
populations examined, the test insecticide employed, and 
the method of exposure. In general, an insect strain or 
population is considered "resistant" if it exhibits 

15 tolerance to a test insecticide (assessed as the dose 

required to poison 50% of a treated population or group) 
that is at least 10 times greater than the tolerance of an 
appropriate reference, or "susceptible" population. Test 
insecticides include not only DDT but also analogs of DDT 

20 (e.g., methoxychlor, perthane) and pyrethroid insecticides 
(e.g., deltamethrin, fenvalerate, resmethrin, permethrin) . 

As also used herein, insects include Musca 
domestica (the house fly) , the fruit or vinegar fly 
(Drosophila mela.noga.ster) , and various other insect 

2 5 species of agricultural, medical or veterinary importance, 

such as Heliothis virescens (the tobacco budworm) , 
Leptinotarsa decemlineata (the Colorado potato beetle) , 
Blattella germanica (the German cockroach) , and Aedes 
aegypti (the yellow fever mosquito) '. 

3 0 " The subject invention provides an isolated 

nucleic acid molecule encoding a voltage-sensitive sodium 
channel (VSSC) of Musca domestica, wherein the VSSC is 
capable of conferring sensitivity or resistance to an 
insecticide in Musca domestica . The nucleic acid molecule 
3 5 can be deoxyribonucleic acid (DNA) or ribonucleic acid 
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(RNA, including messenger RNA or mRNA) , genomic or 
recombinant, biologically isolated or synthetic. 

The DNA molecule can be a cDNA molecule, which 
is a DNA copy of a messenger RNA (mRNA) encoding the VSSC. 
5 In one embodiment, the VSSC confers insecticide 

susceptibility to Musca domestics. An example of such an 
insecticide susceptible VSSC is the channel encoded by the 
nucleotide sequence as shown in SEQ ID NO : 1 . SEQ ID NO : 1 
is the DNA sequence of one allele of the VSSC of Musca 

10 domestica. The amino acid sequence encoded by this allele 
is shown in SEQ ID N0:3. 

In another embodiment, the VSSC confers 
insecticide resistance to Musca domestica . An example of 
such an insecticide resistant VSSC is the channel encoded 
.15 by the nucleotide sequence as shown in SEQ ID NO : 2 . SEQ 
ID NO: 2 is the DNA sequence of another allele of the VSSC 
of Musca domestica characteristic of the kdr insecticide 
resistant strain. The amino acid sequence encoded by this 
mutant allele is shown in SEQ ID NO: 4. 

20 The insecticide resistant allele preferably has 

the nucleotide sequence of a second nucleic acid molecule 
with one or more mutations therein, wherein the second 
nucleic acid molecule encodes an insecticide sensitive 
VSSC and wherein one or more mutations in the second 

25 nucleic acid molecule render the resulting VSSC resistant 
to an insecticide (hence the term "mutant" allele) . In 
one embodiment, the mutant allele (having amino acid SEQ 
ID NO: 4) has the amino acid sequence encoded by the 
susceptibility allele (amino acid SEQ ID NO: 3) with amino 

3 0 acid differences as follows: a substitution of 

phenylalanine for leucine at • amino acid residue 1014 of 
SEQ ID NO: 3; a substitution of isoleucine for methionine 
at amino acid residue 1140 of SEQ ID NO : 3 ; a substitution 
of aspartic acid for glycine at amino acid residue 2 023 of 

35 SEQ ID NO: 3; a deletion of amino acid residues 2031-2034 



of SEQ ID NO: 3 (glycine-alanine-threonine-alanine) ; a 
substitution of threonine for serine at amino acid residue 
2042 of SEQ ID NO : 3 ; a substitution of alanine for valine 
at amino acid residue 2054 of SEQ ID NO : 3 ; and an 
insertion of three amino acid residues (asparagine- 
glycine-glycine) after amino acid residue 2055 of SEQ ID 
NO: 3 (between amino acid residues 2055 and 2056 of SEQ ID 
NO: 3) . One or more of these amino acid differences can be 
included in an insecticide resistant VSSC . Other suitable 
sites for mutations can be identified by conventional, 
molecular genetic approaches, such as the identification 
of amino acid sequence substitutions/insertions/deletions 
in the VSSC sequences of other insecticide-resistant house 
fly strains. 

The invention also provides an antisense 
nucleic acid molecule that is complementary to the mRNA 
encoding the VSSC, or a fragment thereof. Antisense 
nucleic acid molecules can be RNA or single-stranded DNA. 
Antisense molecules can be complementary to the entire DNA 
molecule encoding the VSSC, i.e. of the same nucleotide 
length as the entire molecule. It may be desirable, 
however, to work with a shorter molecule. In this 
instance, fragments of the entire antisense molecule can 
be used. Suitable fragments are capable of hybridizing to 
the mRNA encoding the entire molecule, and preferably 
consist of at least twenty nucleotides. These antisense 
molecules and fragments thereof can be used to reduce 
steady state levels of a VSSC gene product of Musca 
domestica, by introducing into cells an RNA or single- 
stranded DNA molecule that is complementary to the mRNA of 
the VSSC (i.e. by introducing an antisense molecule) . The 
antisense molecule can base -pair with the mRNA of the 
VSSC, preventing translation of the mRNA into protein. 
Thus, an antisense molecule to the VSSC of Musca. domestica. 
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can prevent translation of mRNA encoding the VSSC into a 
functional sodium channel protein. 

More particularly, an antisense molecule 
complementary to mRNA encoding a VSSC of Musca domes tica, 
5 or a fragment thereof, can be used to decrease expression 
of a functional VSSC of Musca domestica. A cell with a 
first level of expression of a functional VSSC of Musca 
domestica is first selected, and then the antisense 
molecule (or fragment thereof) is introduced into the 

10 cell. The antisense molecule (or fragment thereof) blocks 
expression of functional VSSCs of Musca domestica , 
resulting in a second level of expression of a functional 
VSSC of Musca domestica in the cell. The second level is 
less than the initial first level. 

15 Antisense molecules can be introduced into 

cells by any suitable means. Suitable cells include 
Xenopus oocytes which are useful host cells for studying 
the expression of the encoded sodium channel, and various 
insect cells, including but not limited to the insect cell 

20 lines Drosophila Schneider (Johansen et al . 1989) , 

Drosophila K c (Sang 1981), Sf9 (Smith et al . 1983), and 
High Five® (see U.S. Patent No. 5,300,435) . In one 
embodiment, the antisense RNA molecule is injected 
directly into the cellular cytoplasm, where the RNA 

25 interferes with translation. A vector may also be used 
for introduction of the antisense molecule into a cell . 
Such vectors include various plasmid and viral vectors. 
For a general discussion of antisense molecules and their 
use, see Han et al . 1991 and Rossi 1995. 

3 0 The invention further provides a special 

category of antisense RNA molecules, known as ribozymes, 
having recognition sequences complementary to specific 
regions of the mRNA encoding the VSSC of Musca domestica . 
Ribozymes not only complex with target sequences via 

35 complementary antisense sequences but also catalyze the 



hydrolysis, or cleavage, of the template mRNA molecule. 
Examples, which are not intended to be limiting, of 
suitable regions of the mRNA template to be targeted by 
ribozymes are any of the regions encoding the 24 putative 
5 transmembrane domains of the VSSC of Musca domestical. 

Expression of a ribozyme in a cell can inhibit 
gene expression (such as the expression of a VSSC of Musca. 
domestica) . More particularly, a ribozyme having a 
recognition sequence complementary to a region of a mRNA 

10 encoding a VSSC of Musca domestica can be used to decrease 
expression of a functional VSSC of Musca domestica . A 
cell with a first level of expression of a functional VSSC 
of Musca domestica is first selected, and then the 
ribozyme is introduced into the cell . The ribozyme in the 

15 cell decreases expression of a functional VSSC of Musca 
domestica in the cell, because mRNA encoding the VSSC is 
cleaved and cannot be translated. 

Ribozymes can be introduced into cells by any 
suitable means. Suitable cells include Xenopus oocytes 

20 which are useful host cells for studying the expression of 
the encoded sodium channel, and various insect cells, 
including but not limited to the insect cell lines 
Drosophila Schneider, Drosophila K c , Sf9, and High Five®. 
In one embodiment, the ribozyme is injected directly inco 

25 the cellular cytoplasm, where the ribozyme cleaves the 
mRNA and thereby interferes with translation. A vector 
may be used for introduction of the ribozyme into a cell . 
Such vectors include various plasmid and viral vectors 
(note that the DNA encoding the ribozyme does not need to 

3 0 be "incorporated" into the genome of the host cell; it 
could be expressed in a host- cell infected by a viral 
vector, with the vector expressing the ribozyme, for 
instance) . For a general discussion of ribozymes and 
their use, see Sarver et al . 1990, Chrisey et al . 1991, 

35 Rossi et al . 1992, and Christof f ersen et al . 1995. 
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The nucleic acid molecules of the subject 
invention can be expressed in suitable host cells using 
conventional techniques. Any suitable host and/or vector 
system can be used to express the VSSCs . These include, 
5 but are not limited to, eukaryotic hosts such as mammalian 
cells (i.e., Hela cells, Cv-1 cells, COS cells), Xenopus 
oocytes, and insect cells (i.e. insect cell lines such as 
Drosophila. Schneider, Drosophila K c , Sf9, and High Five®) . 

Techniques for introducing the nucleic acid 

10 molecules into the host cells may involve the use of 
expression vectors which comprise the nucleic acid 
molecules. These expression vectors (such as plasmids and 
viruses; viruses including bacteriophage) can then be used 
to introduce the nucleic acid molecules into suitable host 

15 cells. For example, sodium channel expression is often 

studied in Xenopus oocytes. DNA encoding the VSSC can be 
injected into the oocyte nucleus or transformed into the 
oocyte using a suitable vector, or mRNA encoding the VSSC 
can be injected directly into the oocyte, in order to 

20 obtain expression of a functional VSSC in the oocyte. It 
may be beneficial when expressing the sodium channels of 
the subject invention in Xenopus oocytes to coexpress a 
nucleic acid molecule encoding a tipE protein (Feng et al . 
1995) . Tip E has been found to be necessary to obtain 

2 5 expression of some sodium channels in Xenopus oocytes 

(Feng et al . 1995) . 

Various methods are known in the art for 
introducing nucleic acid molecules into host cells . One 
method is microinjection, in which DNA is injected 

3 0 directly into the nucleus of cells through fine glass 

needles (or RNA is injected directly into the cytoplasm of 
cells) . Alternatively, DNA can be incubated with an inert 
carbohydrate polymer (dextran) to which a positively 
charged chemical group (DEAE, for diethylaminoethyl ) has 
35 been coupled. The DNA sticks to the DEAE-dextran via its 
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negatively charged phosphate groups. These large DNA- 
containing particles stick in turn to the surfaces of 
cells, which are thought to take them in by a process 
known as endocytosis. Some of the DNA evades destruction 
in the cytoplasm of the cell and escapes to the nucleus, 
where it can be transcribed into RNA like any other gene 
in the cell. In another method, cells efficiently take in 
DNA in the form of a precipitate with calcium phosphate. 
In electroporation, cells are placed in a solution 
containing DNA and subjected to a brief electrical pulse 
that causes holes to open transiently in their membranes. 
DNA enters through the holes directly into the cytoplasm, 
bypassing the endocytotic vesicles through which they pass 
in the DEAE-dextran and calcium phosphate procedures 
(passage through these vesicles may sometimes destroy or 
damage DNA) . DNA can also be incorporated into artificial 
lipid vesicles, liposomes, which fuse with the cell 
membrane, delivering their contents directly into the 
cytoplasm. In an even more direct approach, used 
primarily with plant cells and tissues, DNA is absorbed to 
the surface of tungsten microproj ectiles and fired into 
cells with a device resembling a shotgun. 

Several of these methods, microinjection, 
electroporation, and liposome fusion, have been adapted to 
introduce proteins into cells. For review, see Mannino 
and Gould-Fogerite 1988, Shigekawa and Dower 1988, 
Capecchi 1980, and Klein et al . 1987. 

Further methods for introducing nucleic acid 
molecules into cells involve the use of viral vectors . 
Since viral growth depends on the ability to get the viral 
genome into cells, viruses have devised clever and 
efficient methods for doing it. One such virus widely 
used for protein production is an insect virus, 
baculovirus . Baculovirus attracted the attention of 
researchers because during infection, it produces one of 



its structural proteins (the coat protein) to spectacular 
levels. If a foreign gene were to be substituted for this 
viral gene, it too ought to be produced at high level . 
Baculovirus, like vaccinia, is very large, and therefore 
5 foreign genes must be placed in the viral genome by- 
recombination. To express a foreign gene in baculovirus, 
the gene of interest is cloned in place of the viral coat 
protein gene in a plasmid carrying a small portion of the 
viral genome. The recombinant plasmid is cotransf ected 

10 into insect cells with wild- type baculovirus DNA. At a 
low frequency, the plasmid and viral DNAs recombine 
through homologous sequences, resulting in the insertion 
of the foreign gene into the viral genome. Virus plaques 
develop, and the plaques containing recombinant virus look 

15 different because they lack the coat protein. The plaques 
with recombinant virus are picked and expanded. This 
virus stock is then used to infect a fresh culture of 
insect cells, resulting in high expression of the foreign 
protein. For a review of baculovirus vectors, see Miller 

20 (1989) . Various viral vectors have also been used to 

transform mammalian cells, such as bacteriophage, vaccinia 
virus, adenovirus, and retrovirus. 

As indicated, some of these methods of 
transforming a cell require the use of an intermediate 

25 plasmid vector. U.S. Patent No. 4,237,224 to Cohen and 
Boyer describes the production of expression systems in 
the form of recombinant plasmids using restriction enzyme 
cleavage and ligation with DNA ligase. These recombinant 
plasmids are then introduced by means of transformation 

3 0 and replicated in unicellular cultures including 

procaryotic organisms and eucaryotic cells grown in tissue 
culture. The DNA sequences are cloned into the plasmid 
vector using standard cloning procedures known in the art, 
as described by Sambrook et al . (1989) . 



• ) o 

- 20 - 

Host cells into which the nucleic acid encoding 
the VSSC has been introduced can be used to produce (i.e. 
to functionally express) the voltage-sensitive sodium 
channel . 

Having identified the nucleic acid molecules 
encoding VSSCs and methods for expressing functional 
channels encoded thereby, the invention further provides a 
method of screening a chemical agent for the ability of 
the chemical agent to modify sodium channel function. The 
method comprises introducing a nucleic acid molecule 
encoding the VSSC into a host cell, and expressing the 
VSSC encoded by the molecule in the host cell . The 
expression results in the functional expression of a VSSC 
in the membrane of the host cell . The cell is then 
exposed to a chemical agent and evaluated to determine if 
the chemical agent modifies the function of the VSSC. 
From this evaluation, chemical agents effective in 
altering the function of the sodium channel can be found. 
Such agents may be, for example, tetrodotoxin, 
veratridine, and scorpion venom toxins. Additional agents 
can be found in Soderlund and Knipple 19 94. 

Cells transformed to include the VSSC according 
to the subject invention can be exposed to various 
potential insecticides and pesticides and evaluated for 
their susceptibility to the agents to develop and identify 
insect control agents that will not cause adverse effects 
to vertebrate species. Exemplary methods of screening are 
described in Eldefrawi et al . 1987 and Rauh et al . 1990. 
The evaluation of the function of the sodium channel can 
be by any means known in the art. In one embodiment, the 
evaluation comprises monitoring sodium transport through 
the VSSC. Sodium transport can be monitored by pre- 
incubating cells in a medium containing one or more 
chemical agents, adding a medium containing radiosodium 
( 22 Na + ) , incubating the cells further in this medium, and 



isolating cells by filtration. Sodium transport is 
detected by the measurement of 22 Na* within the cells by 
liquid scintillation counting or other radiometric 
techniques (Bloomquist and Soderlund 1988) . 

Alternatively, [ 14 C] guanidinium ion can be employed as the 
radiotracer in the place of sodium using the same 
procedure (Jacques et al . 1978) . In another embodiment, 
the function of the VSSC can be evaluated by pre- 
incubating cells to equilibrium with a sodium-selective 
fluorescent chelating agent (e.g., SBFI [sodium-binding 
benzofuran isophthalate] ) , washing the cells, exposing the 
cells to a test agent, and monitoring the increase in 
intracellular sodium by measuring the fluorescence of the 
SBFI -sodium complex (Deri and Adam-Vizi 1993) . 

The nucleic acid molecules of the subject 
invention can be used either as probes or for the design 
of primers to obtain DNA encoding other VSSCs by either 
cloning and colony/plaque hybridization or amplification 
using the polymerase chain reaction (PCR) . 

Specific probes derived from SEQ ID NOs 1 or 2 
can be employed to identify colonies or plaques containing 
cloned DNA encoding a member of the VSSC family using 
known methods (see Sambrook et al . 1989) . One skilled in 
the art will recognize that by employing such probes under 
high stringency conditions (for example, hybridization at 
42°C with 5X SSPC and 50% formamide, washing at 50-65°C 
with 0 . 5X SSPC) , sequences having regions which are 
greater than 90% identical to the probe can be obtained. 
Sequences with lower percent identity to the probe, which 
also encode VSSCs, can be obtained by lowering the 
stringency of hybridization and washing (for example, by 
reducing the hybridization and wash temperatures or 
reducing the amount of formamide employed) . 

More particularly, in one embodiment, the 
method comprises selection of a DNA molecule encoding a 



VSSC of an insect, or a fragment thereof, the DNA molecule 
having a nucleotide sequence selected from the group 
consisting of SEQ ID N0:1 and SEQ ID NO : 2 , and designing 
an oligonucleotide probe for a VSSC based on SEQ ID N0:1 
or SEQ ID NO: 2. A genomic or cDNA library of an insect is 
then probed with the oligonucleotide probe, and clones are 
obtained from the library that are recognized by the 
oligonucleotide probe so as to obtain DNA encoding another 
VSSC . 

Specific primers derived from SEQ ID NOs 1 or 2 
can be used in PCR to amplify a DNA sequence encoding a 
member of the VSSC family using known methods (see Innis 
et al . 1990) . One skilled in the art will recognize that 
by employing such primers under high stringency conditions 
(for example, annealing at 50-60°C, depending on the 
length and specific nucleotide content of the primers 
employed) , sequences having regions greater than 75% 
identical to the primers will be amplified. 

More particularly, in a further embodiment the 
method comprises selection of a DNA molecule encoding a 
' VSSC of an insect, or a fragment thereof, the DNA molecule 
having a nucleotide sequence selected from the group 
consisting of SEQ ID NO : 1 and SEQ ID NO : 2 , designing 
degenerate oligonucleotide primers based on regions of SEQ 
ID NO : 1 or SEQ ID NO : 2 , and employing such primers in the 
polymerase chain reaction using as a template a DNA sample 
to be screened for the presence of VSSC-encoding 
sequences. The resulting PCR products can be isolated and 
sequenced to identify DNA fragments that encode 
polypeptide sequences corresponding to the targeted region 
of a VSSC. 

Various modifications of the nucleic acid and 
amino acid sequences disclosed herein are covered by the 
subject invention. These varied sequences still encode a 
functional VSSC. The invention thus further provides an 



isolated nucleic acid molecule encoding a VSSC of an 
insect, the nucleic acid molecule encoding a first amino 
acid sequence having at least 95% amino acid identity to a 
second amino acid sequence, the second amino acid sequence 
being as shown in SEQ ID N0:3. The resulting encoded VSSC 
is susceptible to an insecticide. The invention also 
provides an isolated nucleic acid molecule encoding a VSSC 
of an insect, the nucleic acid molecule encoding a first 
amino acid sequence having at least 95% amino acid 
identity to a second amino acid sequence, the second amino 
acid sequence being as shown in SEQ ID NO : 4 . The 
resulting VSSC is resistant to an insecticide. 

The invention further provides isolated 
voltage -sensitive sodium channels of Musca domestica, 
wherein the VSSC is capable of conferring sensitivity or 
resistance to an insecticide in Musca domestica. In one 
embodiment, the VSSC confers susceptibility to an 
insecticide in Musca domestica, such as the VSSC encoded 
by the nucleotide sequence as shown in SEQ ID NO : 1 (which 
encodes an amino acid sequence as shown in SEQ ID NO : 3 ) . 
In a further embodiment, the VSSC confers resistance to an 
insecticide in Musca domestica, such as the VSSC encoded 
by the nucleotide sequence as shown in SEQ ID NO : 3 (which 
encodes an amino acid sequence as shown in SEQ ID NO : 4 ) . 
Preferably, the insecticide resistant VSSC is encoded by a 
nucleic acid molecule having the nucleotide sequence of a 
second nucleic acid molecule with one or more mutations 
therein, wherein the second nucleic acid molecule encodes 
an insecticide sensitive VSSC, and wherein the one or more 
mutations in the second nucleic acid molecule render the 
resulting voltage-sensitive 'sodium channel resistant to an 
insecticide. For example, the nucleotide sequence of the 
second nucleic acid molecule may encode amino acid SEQ ID 
NO: 3, and the insecticide resistant VSSC may have that 
-amino acid sequence with one or more differences therein 
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as follows: a substitution of phenylalanine for leucine 
at amino acid residue 1014 of SEQ ID NO : 3 ; a substitution 
of isoleucine for methionine at amino acid residue 1140 of 
SEQ ID NO : 3 ; a substitution of aspartic acid for glycine 
5 at amino acid residue 2023 of SEQ ID NO: 3; a deletion of 
amino acid residues 2031-2034 of SEQ ID NO : 3 (glycine- 
alanine- threonine-alanine) ; a substitution of threonine 
for serine at amino acid residue 2042 of SEQ ID NO : 3 ; a 
substitution of alanine for valine at amino acid residue 

10 2054 of SEQ ID NO : 3 ; and an insertion of three amino acid 
residues (asparagine-glycine-glycine) after amino acid 
residue 2055 of SEQ ID NO : 3 (between amino acid residues 
2055 and 2056 of SEQ ID NO : 3 ) . 

A variety of methodologies known in the art can 

15 be utilized to obtain an isolated VSSC according to the 

subject invention. In one method, the channel protein is 
purified from tissues or cells which naturally produce the 
channel protein. One skilled in the art can readily 
follow known methods for isolating proteins in order to 

2 0 obtain a member of the VSSC protein family, free of 

natural contaminants. These include, but are not limited 
to, immunochromatography, HPLC, size-exclusion 
chromatography, ion-exchange chromatography, and 
immunoaf f inity chromatography. In another embodiment, a 

25 member of the VSSC family can be purified from cells which 
have been altered to express the channel protein'. As used 
herein, a cell is said to be "altered to express the 
channel protein" when the cell, through genetic 
manipulation, is made to produce the channel protein which 

30 it normally does not produce or which the cell normally 
produces at low levels. One -skilled in the art can 
readily adapt procedures for introducing and expressing 
either genomic, cDNA or synthetic sequences into either 
eukaryotic or prokaryotic cells in order to generate a 



cell which produces a member of the VSSC family utilizing 
the sequences disclosed herein. 

A VSSC as defined herein includes molecules 
encoding VSSCs encoded by an amino acid sequence having at 
5 least 95% amino acid identity to SEQ ID NO : 3 or to SEQ ID 
NO: 4 . 

Antibodies can be raised to the voltage- 
sensitive sodium channel. Antibodies of the subject 
invention include polyclonal antibodies and monoclonal 

10 antibodies capable of binding to the channel protein, as 
well as fragments of these antibodies, and humanized 
forms. Humanized forms of the antibodies of the subject 
invention may be generated using one of the procedures 
known in the art such as chimerization . Fragments of the 

15 antibodies of the present invention include, but are not 
limited to, the Fab, the Fab2 , and the Fd fragments. 

The invention also provides hybridomas which 
are capable of producing the above-described antibodies. 
A hybridoma is an immortalized cell line which is capable 

20 of secreting a specific monoclonal antibody. 

In general, techniques for preparing polyclonal 
and monoclonal antibodies as well as hybridomas capable of 
producing the desired antibody are well known in the art 
(see Campbell 1984 and St. Groth et al . 1980). Any animal 

25 (mouse, rabbit, etc.) which is known to produce antibodies 
can be immunized with the antigenic channel protein (or an 
antigenic fragment thereof) . Methods for immunization are 
well known in the art. Such methods include subcutaneous 
or intraperitoneal injection of the protein. One skilled 

3 0 in the art will recognize that the amount of the channel 
protein used for immunization will vary based on the 
animal which is immunized, the antigenicity of the 
protein, and the site of injection. 

The protein which is used as an immunogen may 

3 5 be modified or administered in an adjuvant in order to 



increase the protein's antigenicity. Methods of 
increasing the antigenicity of a protein are well known in 
the art and include, but are not limited to, coupling the 
antigen with a heterologous protein (such as a globulin or 
5 beta-galactosidase) or through the inclusion of an 
adjuvant during immunization. 

For monoclonal antibodies, spleen cells from 
the immunized animals are removed, fused with myeloma 
cells, such as SP2/0-Ag 15 myeloma cells, and allowed to 

10 become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in 
the art can be used to identify the hybridoma cell which 
produces an antibody with the desired characteristics. 
These include screening the hybridomas with an ELISA 

15 assay, western blot analysis, or radioimmunoassay (Lutz et 
al . 1988) . 

Hybridomas secreting the desired antibodies are 
cloned and the class and subclass are determined using 
procedures known in the art (Campbell 19 84) . 

2 0 For polyclonal antibodies, antibody containing 

antisera is isolated from the immunized animal and is 
screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures . 

The present invention further provides the 
25 above -described antibodies in detectably labeled form. 
Antibodies can be detectably labeled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, 
etc.), enzymatic labels (such as horseradish peroxidase, 
alkaline phosphatase, etc.), fluorescent labels (such as 

3 0 FITC or rhodamine, etc.), paramagnetic atoms, etc. 

Procedures for accomplishing -such labeling are well known 
in the art, for example see Sternberger et al . 1970, Bayer 
et al . 1979, Engval et al . 1972, and Goding 1976. 

The labeled antibodies or fragments thereof of 
3 5 the present invention can be used for in vitro, in vivo, 
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and in situ assays to identify cells or tissues which 
express a VSSC, to identify samples containing the VSSC 
proteins, or to detect the presence of a VSSC in a sample. 
More particularly, the antibodies or fragments thereof can 
thus be used to detect the presence of a VSSC in a sample, 
by contacting the sample with the antibody or fragment 
thereof. The antibody or fragment thereof binds to any 
VSSC present in the sample, forming a complex therewith. 
The complex can then be detected, thereby detecting the 
presence of the VSSC in the sample. 

Fragments of the nucleic acid molecules 
encoding a VSSC are also provided, and are best defined in 
the context of amino acid sequence relationships among 
members of the VSSC sequence family and information on the 
function of specific VSSC domains. For example the amino 
acid sequence encoded by nucleotides 4648-4803 of SEQ ID 
NOs 1 or 2 encodes an amino acid sequence that is highly 
conserved among VSSC family members and is identified as 
the structural component forming the " inactivation gate" 
of sodium channels. Antibodies prepared to the 
polypeptide encoded by this fragment would therefore be 
expected to be of use as reagents capable of detecting 
many members of the VSSC family. Such antibodies, if 
introduced into cells that express VSSCs, would also be 
expected to modify the normal function of the VSSCs 
expressed in those cells. In contrast, the amino acid 
sequence encoded by nucleotides 3079-3852 of SEQ ID NOs 1 
or 2 encodes an amino acid sequence that is less well 
conserved between the VSSCs of the insects Musca domes tica 
and Drosophila melanogaster . Antibodies prepared to the 
polypeptide encoded by this fragment would therefore be 
expected to recognize selectively the VSSC from which the 
fragment was derived. 

Also provided by the subject invention is a 
plasmid designated pPJIl and deposited with the ATCC under 



Accession No. , as well as a Kpnl/Aatll restriction 

fragment of about 3620 bp of the plasmid designated pPJIl . 
Further provided is a plasmid designated pPJI2 and 

deposited with the ATCC under Accession No. , as well 

5 as an Aatll/Sphll restriction fragment of about 2700 bp of 
the plasmid designated pPJI2 . When the above two 
restriction fragments are ligated together at their Aatll 
sites, the resulting nucleic acid molecule encodes a 
voltage-sensitive sodium channel which confers 
10 susceptibility to an insecticide in Musca domestical . This 
resulting nucleic acid molecule is also provided by the 
subject invention. 

MATERIALS AND METHODS 

15 Heads of newly-emerged adult house flies (NAIDM 

or 538ge strain) (Knipple et al . 1994) were ground to a 
fine powder under liquid N 2 and extracted with acid 
guanidinium isothiocyanate/phenol/chlorof orm to obtain 
total RNA (Chomczynski and Sacchi 1987) , which was 

20 fractionated on oligo (dT) -paramagnetic beads (PolyATtract 
mRNA isolation system; Promega, Madison, WI) to obtain 
poly(A + ) RNA. Pools of first strand cDNA were synthesized 
using either random hexamers (Harvey and Darlison 1991) or 
oligo (dT) adapted for the 3 ' -RACE procedure (Frohman and 

25 Martin 1989) . These cDNA pools were employed as templates 
in the polymerase chain reaction (PCR) (Saiki et al . 1988) 
to amplify overlapping cDNA segments spanning the entire 
Vsscl coding sequence. Mixed- sequence oligonucleotide 
primers employed for these amplifications comprised all 

3 0 possible sequence combinations encoding short (i.e., 6-8 
residues) regions of amino acid conservation between the 
para gene of D. mela.noga.ster and rat brain sodium channel 
I (Loughney et al . 1989; Knipple et al . 1991) . In a few 
cases, mixed- sequence primers were based solely on the D. 

35 melanogaster sequence. Defined- sequence primers were 
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derived either from the previously described 3 09- 
nucleotide exon of the house fly Vsscl gene (Knipple et 
al . 19 94) or from internal sequences of house fly cDNA 
fragments obtained by amplification with mixed- sequence 
5 primers. All primers were synthesized using an Applied 
Biosystems 3 92 instrument, deprotected using procedures 
provided by Applied Biosystems, desalted, and used without 
further purification. The sequences and designations of 
these primers are given in Table I . The methods and 

10 reagents employed in PCR amplifications are described 
elsewhere (Knipple et al . 1991; Henderson et al . 1994; 
Knipple et al . 1994); specific amplification conditions 
for each cDNA fragment were optimized by varying the 
annealing temperatures and extension times of the 

15 reaction. Following amplification, PCR products were 

separated from excess primers either by filtration of the 
reaction mixture through a Centricon-10 0 concentrator 
(Amicon, Beverly, MA) or by preparative electrophoresis on 
agarose gels, excision of the desired product, and 

20 extraction from the gel matrix (QIAquick spin column; 

Qiagen, Chatsworth, CA) prior to use as templates for DNA 
sequencing . 

The DNA sequences of amplified cDNA fragments 
were determined by automated sequencing with an Applied 

25 Biosystems 373 instrument using f luorescently-labeled 

dideoxynucleotides and Taq DNA polymerase (PCR/Sequencing 
Kit; Applied Biosystems, Foster City, CA) in a 
modification of the dideoxynucleotide chain-termination 
method (Sanger et al . 1977). Sequencing of each 

3 0 amplification product was initiated by using the 

amplification primers to sequence inward from the termini, 
and additional primers were synthesized as needed to 
obtain the complete sequence of each strand. Mixed- 
sequence amplification primers were employed for 

3 5 sequencing at concentrations 10 -fold higher than that used 



' ) 

- 30 - 

for def ined-sequence primers. All sequence ambiguities 
and apparent polymorphisms were resolved by performing 
additional multiple sequencing reactions. The full-length 
Vsscl coding sequences from the NAIDM and 53 8ge strains 
5 were compiled from 239 and 209 individual sequencing 

reactions, respectively, and were edited using the SeqEd 
software program (Applied Biosystems) . Complete house fly 
Vsscl sequences were analyzed and compared with published 
sodium channel sequences using the DNASTAR software 
10 package (DNASTAR, Madison, WI) . 

EXAMPLE I 

SEQUENCING OF THE INSECTICIDE 
15 ' SENSITIVE VSSC OF HOUSE FLY 

As an expedient alternative to conventional 
iterative screenings of cDNA libraries, a sequencing 
strategy for the house fly Vsscl gene was based on the PCR 

20 amplification and direct automated sequencing of 

overlapping cDNA fragments (Fig. 2) . The point of entry 
for this strategy was the 309-nucleotide exon of the house 
fly Vsscl gene identified previously from sequencing of 
cloned genomic DNA (Knipple et al . 19 94) . The use of 

25 def ined-sequence primers from this region (Table I, Al or 
B2) in combination with mixed- sequence primers encoding 
conserved amino acid sequences in either region IIS3 (A2) 
or the extracellular N- terminal domain (Bl) gave cDNA 
fragments A and B. A second point of entry was established 

3 0 in homology domain IV using a pair of mixed- sequence 

primers (CI and C2) to obtain fragment C. A primer (D2) 
designed from the internal sequence of fragment C, 
together with a mixed- sequence primer (Dl) encoding a 
conserved amino acid motif in the short linker between 

3 5 homology domains III and IV, gave fragment D. A pair of 



- 31 - 



defined- sequence primers (El, E2) based on internal 
sequences of fragments A and D gave the large fragment E, 
which spanned most of homology domain II and all of 
homology domain III. Fragment F, corresponding to the 5' 
5 end of the coding sequence, was obtained using a defined- 
sequence primer (F2) derived from the internal sequence of 
fragment B and a mixed- sequence primer (Fl) derived from a 
segment of the D. mela.noga.ster sequence upstream from the 
translation start site (Loughney et al . 1989) . Similarly, 

10 fragment G, containing the 3' end of the coding sequence, 
was obtained using a defined- sequence primer (Gl) derived 
from the internal sequence of fragment C and a mixed- 
sequence primer (G2) derived from a segment of the D. 
melanogaster sequence downstream from the stop codon 

15 (Thackeray and Ganetzky 1994) . 

The complete coding sequence of the Vsscl NAIDM 
allele of the house fly, comprising a single open reading 
frame of 6318 nucleotides (SEQ ID N0:1), was determined by 
automated DNA sequencing using cDNA fragments A - G as 

2 0 templates (Fig. 2) . This cDNA coded for a 2 105 -amino acid 
polypeptide (SEQ ID NO: 3) with a predicted molecular 
weight of 236, 671 Daltons that exhibited all of the common 
structural landmarks found in sodium channel a subunit 
genes (Catterall 1992; Kallen et al . 1993) (see Fig. 3) , 

2 5 including four large internally homologous subdoraains (I- 

IV) , each containing six hydrophobic putative 
transmembrane helices (S1-S6) and a conserved sequence 
element between domains S5 and S6 identified as an ion 
pore-forming domain. The deduced Vsscl NAIDM amino acid 

3 0 sequence also contained a conserved element in the S4 

region of each homology domain, characterized by a 
repeated motif of positively-charged amino acids that are 
thought to form the voltage-sensing element of the 
channel, and a short segment of conserved sequence between 
35 homology domains III and IV that has been identified as 



the channel inactivation gate (see Fig. 3) . The deduced 
Vjssc2 naidm protein contained 10 potential sites for N- linked 
glycosylation (Kornfeld and Kornfeld 1985) , 6 of which 
occur in putative extracellular regions . These regions of 
5 other sodium channel a subunit sequences are also known to 
contain potential glycosylation sites (Catterall 1992; 
Kallen et al . 1993). 

Vertebrate sodium channels are known to undergo 
functional regulation as the result of phosphorylation by 

10 cAMP- dependent protein kinases at sites in the 

intracellular linker between homology domains I and II and 
by protein kinase C at a site in the intracellular linker 
between homology domains III and IV (Catterall 1992; 
Kallen et al . 1993) . The deduced Vsscl NAIDM protein 

15 contained three potential cAMP- dependent protein kinase 
phosphorylation sites (Kemp and Pearson 1990) (Ser54 0, 
Ser557, and Ser628) in the cytoplasmic linker between 
homology domains I and II. The location of two of these 
(Ser540 and Ser557 of SEQ ID NO: 3) corresponded to the 

20 cluster of four sites found in this region of vertebrate 
brain sodium channels that are implicated in sodium 
channel regulation (Catterall 1992) . The deduced Vsscl NAIDM 
protein also contained three additional potential 
phosphorylation sites (Serll67, Serl207, and Ser2097 of 

25 SEQ ID NO: 3) in other putative intracellular domains. The 
role of these phosphorylation sites in the regulation of 
insect sodium channels by cAMP- dependent protein kinase is 
not known. The deduced house fly voltage-sensitive sodium 
channel protein also contained two potential sites for 

30 protein kinase C phosphorylation (Serll91 and Serl582 of 
SEQ ID N0:3) (Kemp and Pearson 1990), the latter of which 
is the conserved site located within the inactivation gate 
sequence of the cytoplasmic linker between domains III and 
IV. Although the conservation of this site implicates a 

35 role for protein kinase C in the regulation of insect 



sodium channels, such an effect has not been demonstrated 
experimentally. 

The deduced Vsscl™ 1 ™ protein was 90.0% 
identical to the most similar variant of the para gene 
5 product of D. melanogaster (SEQ ID NO: 19) (Loughney et al . 
198 9; Thackeray and Ganetzky 1994) (Fig. 3) . The level of 
sequence identity was highest (s>95%) in the N- terminal 
intracellular domain, the linker between homology domains 
III and IV, and homology domain IV. The level of sequence 

10 identity was lowest (73%) in the intracellular C-terrainal 
domain. Alignment of the Vsscl sequence with 12 other 
sodium channel a subunit sequences found in the GenBank 
database showed that the Vsscl and para, gene products 
exhibited approximately the same degree of sequence 

15 similarity as homologous sodium channel or subunit isoforms 
from different vertebrate species. These findings confirm 
and extend previous observations (Williamson et al . 1993; 
Knipple et al . 1994), based on fragmentary genomic DNA and 
cDNA sequences, of the high degree of sequence similarity 

20 between this house fly gene and the' para gene of D. 

melanogaster and reinforce the conclusion that Vsscl is 
the homolog of para in the house fly. 

In D. melanogaster (Thackeray and Ganetzky 
1994; O'Dowd et al . 1995) and Drosophila virills 

25 (Thackeray and Ganetzky 1995) , multiple sodium channel a 
subunit variants, each under specific developmental 
regulation, are generated from the para gene by the 
alternative usage of 8 exons (designated a-f, h, and i) 
located in homology domain II and portions of the 

3 0 cytoplasmic linker regions on either side of this domain. 
Given the heterogeneity of sodium channel -encoding 
sequences found in these Dipteran species, it was 
surprising to detect only a single sequence variant among 
the pool of amplified house fly head cDNA fragments. The 

3 5 Vssc2 NAIDM sequence contained segments identical to exon a 
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and homologous (21 identical amino acids out of 24) to 
exon i of. D. melanogaster . Recent studies suggest that 
both of these exons are required for the expression of 
high sodium current densities in embryonic D. melanogaster 
5 neurons (O'Dowd et al. 19 95) . In the region encoded by 
either exon c or exon d, the house fly sequence differs 
from both D. melanogaster sequences but is slightly more 
similar to exon d (50 identical amino acids out of 55) 
than to exon c (49 identical amino acids out of 55) . The 

10 house fly sequence lacked segments homologous to D. 

melanogaster exons b, e, and f but contained a segment 
identical to exon h, which is a variable element found in 
some D. virilis sequences but not detected in D. 
melanogaster. The house fly Vsscl NAIDM sequence described 

15 is thus characterized as structurally homologous to the 
a*b'c'cTe'f'hM + splice variant of D . melanogaster and D . 
virilis. The identification of this molecular form as the 
predominant sodium channel sequence variant in house fly 
heads was unexpected because it has not been detected 

2 0 among the arrays of splice variants detected in whole 

embryos or whole adults of either D. melanogaster or D. 
virilis . 

EXAMPLE II 

25 

SEQUENCING OF THE INSECTICIDE 
RESISTANT VSSC OF HOUSE FLY 

The PCR amplification/ sequencing strategy 

3 0 summarized in Fig. 2 was also employed to determine the 

sequence of Vsscl cDNAs from -heads of the 538ge house fly 
strain that carries the kdr trait. The nucleotide 
sequence of the VSSC of the 53 8ge house fly is shown in 
SEQ ID NO: 2, and the amino acid sequence is shown in SEQ 
35 ID NO: 4. The amino acid sequence of 2104 residues (SEQ ID 
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N0:4) encoded by the Vsscl 53Sge cDNA contained 12 amino acid 
differences compared to that of the Vsscl KAIDM sequence (SEQ 
ID NO: 3) as follows: a substitution of phenylalanine for 
leucine at amino acid residue 1014 of SEQ ID NO : 3 ; a 
5 substitution of isoleucine for methionine at amino acid 
residue 114 0 of SEQ ID NO: 3; a substitution of aspartic 
acid for glycine at amino acid residue 2 023 of SEQ ID 
NO: 3; a deletion of amino acid residues 2031-2034 of SEQ 
ID NO:3 (glycine-alanine-threonine-alanine); a 

10 substitution of threonine for serine at amino acid residue 
2042 of SEQ ID NO: 3; a substitution of alanine for valine 
at amino acid residue 2054 of SEQ ID NO: 3; and an 
insertion of three amino acid residues (asparagine- 
glycine-glycine) after amino acid residue 2055 of SEQ ID 

15 NO:3 (between amino acid residues 2055 and 2056' of SEQ ID 
NO:3) . A comparison of the Vsscl 5283e (SEQ ID NO : 4 ) and 
Vsscl NAIDM (SEQ ID NO: 3) amino acid sequences to the para 
sequence of the Canton-S strain of D. melanogas ter (SEQ ID 
NO: 19) is shown in Fig. 3. The locations and amino acid 

2 0 sequence context of the differences are shown in Fig. 4. 

In Fig. 4, S refers to the NAIDM amino acid sequence (SEQ 
ID NO: 3), and R refers to the kdr sequence (SEQ ID NO : 4 ) . 
Dashes indicate that the Kdr sequence has the identical 
residue at that position as does the NAIDM sequence. The 

25 difference labeled 1 shows amino acids 1009-1019 of SEQ ID 
NO: 3, with the amino acid substitution at residue 1014 
shown. The difference labeled 2 shows amino acids 113 5- 
1145 of SEQ ID NO: 3, with the amino acid substitution at 
residue 114 0 shown. The difference labeled 3 shows amino 

30 acids 2018-2028 of SEQ ID NO : 3 , with the amino acid 
substitution at residue 2023- shown. The difference 
labeled 4 shows amino acids 2027-2038 of SEQ ID NO : 3 , with 
the deletion of residues 2031-2034 shown. The difference 
labeled 5 shows amino acids 2037-2047 of SEQ ID NO : 3 , with 

3 5 the amino acid substitution at residue 2 042 shown. The 



difference labeled 6 shows amino acids 2051-2059 of SEQ ID 
NO:3, with the amino acid substitution at residue 2054 
shown and the insertion of three residues between 2 055 and 
2056 shown. 

5 

Although preferred embodiments have been 
depicted and described in detail herein, it will be 
apparent to those skilled in the relevant art that various 
modifications, additions, substitutions and the like can 
10 be made without departing from the spirit of the invention 
and these are therefore considered to be within the scope 
of the invention as defined in the claims which follow. 

Table 1. Names and sequences of oligonucleotide primers 



15 


used 


in 


the PCR amplification of partial 


Vsscl 


cDNAs . 




Name 




Sequence 












Al 


5' 


-CGGTTGGGCTTTCCTGTC-3 ' 


SEQ 


ID 


NO 


5 


20 


A2 • 


5' 


- GGGAATTCRAADATRTTCCANCCYTC - 3 ' 


SEQ 


ID 


NO 


6 




Bl 


5' 


-CCCGARGAYATHGAYCYNTAYTA-3 ' 


SEQ 


ID 


NO 


7 




B2 


5' 


-CGTATCGCCTCCTCCTCG-3 ' 


SEQ 


ID 


NO 


8 




CI 


5' 


-GGGTCTAGATHTTYGCNATHTTYGGNATG ' 3 


SEQ 


ID 


NO 


9 




C2 


5' 


- GGGGAATTCNGGRTCRAAYTGYTGCCA- 3 ' 


SEQ 


ID 


NO 


10 


25 


Dl 


5' 


- GGGTCTAGARGANCARAARAARTAYTA- 3 ' 


SEQ 


ID 


NO 


11 




D2 


5' 


-TCATACTTTGGCCCAATGTC-3 ' 


SEQ 


ID 


NO 


12 




El 


5' 


- CCCGAATTAGAGAAGGTGCTG- 3 ' 


SEQ 


ID 


NO 


13 




E2 


5' 


-ACTATTGCTTGTGGTCGCCAC- 3 ' 


SEQ 


ID 


NO 


14 




Fl 


5 ' 


-CATCNTTRGCNGCNTAGACNATGAC-3 ' 


SEQ 


ID 


NO 


15 


30 


F2 


5' 


-GATTGAATGGATCGAGCAGCC-3 ' 


SEQ 


ID 


NO 


16 




Gl 


5' 


-CGTTTCTCCTTTCATATCTAG- 3 ' 


SEQ 


ID 


NO 


17 




G2 


5' 


-GGAGBGGBGGNCKBGGNCKNGCTCA-3 ' 


SEQ 


ID 


NO 


18 



Designation of oligonucleotide mixtures: B=G+T+C; 
3 5 D=G+A+T ; H=A+T+C; K=G+T ; N=A+C+G+T; R=A+G; Y=C+T . 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Soderlund, David M. 

Knipple, Douglas C. 
Ingles, Patricia J. 

(ii) TITLE OF INVENTION: INSECT SODIUM CHANNELS FROM 

INSECTICIDE-SUSCEPTIBLE AND INSECTICIDE-RESISTANT 
HOUSE FLIES 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 6318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATGACAGAAG 


ATTCCGACTC 


GATATCTGAG 


GAAGAACGCA 


GTTTGTTCCG 


TCCCTTCACC 


60 


CGCGAATCAT 


TGTTACAAAT 


CGAACAACGT 


ATCGCTGAAC 


ATGAAAAACA 


AAAGGAGCTG 


120 


QAAAGAAAGA 


GAGCCGCCGA 


AGGAGAGCAG 


ATACGATATG 


ATGACGAGGA 


CGAAGATGAA 


180 


(jSTCCACAGC 


CGGATCCCAC 


ACTTGAACAG 


GGTGTGCCTA 


TACCTGTTCG 


AATGCAGGGC 


240 


4-bCTTCCCGC 


CGGAATTGGC 


CTCCACTCCT 


CTCGAGGATA 


TCGATCCCTT 


CTACAGTAAT 


300 


GTACTGACAT 


TTGTAGTAAT 


AAGTAAAGGA 


AAGGATATTT 


TTCGTTTTTC 


TGCCTCAAAA 


360 


GCAATGTGGC 


TGCTCGATCC 


ATTCAATCCG 


ATACGTCGTG 


TAGCCATTTA 


TATTTTAGTG 


420 


(fATCCCTTGT 


TTTCGTTATT 


CATTATCACC 


ACTATTCTAA 


CTAAT TGT AT 


TTTAATGATA 


480 


ATGCCGACAA 


CGCCCACGGT 


CGAATCCACA 


GAGGTGATAT 


TCACCGGAAT 


CTACACATTT 


540 


GAATCAGCTG 


TTAAAGTGAT 


GGCACGAGGT 


TTCATTTTAT 


GCCCGTTTAC 


GTATCTTAGA 


600 


GATGCATGGA 


ATTGGCTGGA 


CTTCGTAGTA 


ATAGCTTTAG 


CTTATGTGAC 


CATGGGCATA 


660 


GATTTAGGTA 


ATCTCGCAGC 


TTTGAGAACA 


TTTAGGGTAC 


TGCGAGCTCT 


GAAAACCGTA 


720 


GCCATTGTGC 


CAGGTCTAAA 


AACCATTGTC 


GGTGCTGTCA 


TTGAATCTGT 


AAAAAATCTA 


780 


CGCGATGTGA 


TAATTTTGAC 


AATGTTTTCC 


CTGTCGGTGT 


TCGCGCTGAT 


GGGCCTACAA 


840 


ATCTATATGG 


GTGTTCTAAC 


ACAAAAGTGC 


ATTAAACGAT 


TCCCCCTGGA 


CGGCAGTTGG 


900 


GGCAATCTGA 


CCGATGAAAA 


CTGGTTTCTA 


CACAATAGCA 


ACAGTTCCAA 


TTGGTTTACG 


960 


GAGAACGATG 


GCGAGTCATA 


TCCGGTGTGC 


GGGAATGTAT 


CCGGTGCGGG 


ACAATGCGGC 


1020 


GAGGATTACG 


TCTGCCTGCA 


GGGCTTCGGC 


CCCAATCCCA 


ACTACGACTA 


CACCAGTTTC 


1080 


GATTCATTCG 


GTTGGGCTTT 


CCTGTCGGCG 


TTTCGTCTCA 


TGACCCAAGA 


TTTCTGGGAG 


1140 


GATCTGTATC 


AGCACGTGCT 


GCAAGCAGCT 


GGACCCTGGC 


ACATGTTGTT 


CTTTATAGTC 


1200 
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ATCATCTTCC TAGGTTCATT CTATCTTGTG AATTTGATTT TGGCCATTGT TGCCATGTCT 12 60 

TATGACGAAT TGCAAAAGAA GGCCGAAGAA GAAGAGGCTG CCGAGGAGGA GGCGATACGA 13 2 0 

GAAGCTGAAG AAGCGGCAGC AGCCAAGGCG GCCAAACTGG AGGAGCGGGC CAATGTAGCA 13 80 

GCTCAAGCGG CTCAGGATGC AGCGGATGCC GCTGCGGCAG CTCTGCATCC CGAGATGGCA 144 0 

AAGAGTCCCA CGTACTCTTG CATTAGCTAT GAACTGTTTG TTGGCGGCGA GAAGGGCAAC 15 00 

GATGACAACA ACAAAGAGAA GATGTCCATA CGCAGCGTCG AAGTGGAATC GGAGTCGGTG 15 6 0 

AGCGTTATAC AAAGACAACC AGCACCTACC ACAGCACCCG CTACTAAAGT CCGTAAAGTT 162 0 

AGCACGACTT CCTTATCCTT ACCTGGTTCA CCATTTAACC TACGCCGGGG AT C ACGTAGT 16 8 0 

TCACACAAGT ACACAATACG AAATGGGCGT GGACGTTTTG GTATACCAGG TAGCGATCGC 174 0 

I AAGCCATTGG TACTGCAAAC ATATCAGGAT GCCCAGCAGC ATTTGCCCTA TGCCGATGAC 18 00 

^TCGAATGCCG TAACACCAAT GTCCGAAGAG AATGGTGCCA TTATAGTACC AGC CT ACT AT 18 60 

TGTAATTTAG GTT CTAGAC A TTCTTCATAT ACCTCGCATC AATCAAGAAT CTCGTATACA 192 0 

f TCACATGGTG ATTTATTGGG TGGCATGGCG GCCATGGGTG CCAGCACAAT GAC CAAAGAG 198 0 

AGCAAATTGC GCAGTCGCAA CACACG CAAT CAATCAATCG GTGCTGCAAC CAATGGTGGC 2 04 0 

AGTAGTACGG CTGGTGGTGG CTATCCCGAT GCCAATCACA AGGAACAAAG GGATTATGAA 210 0 

tfJATGGGTCAGG ATTATACAGA CGAAGCTGGC AAAATAAAAC ACCACGACAA TCCTTTTATC 216 0 

GAGCCCGTCC AAACTCAAAC AGTGGTAGAC ATGAAAGATG TTATGGTCTT AAATGAT AT C 2 22 0 

ATTGAACAAG CCGCTGGTCG GCATAGTCGT GCTAGTGAAC GAGGTGAGGA CGATGACGAA 2 280 

GATGGTCCCA CATT CAAGGA CATCGCCCTC GAATACATCC TAAAAGGCAT CGAAATCTTT 2 340 

TGTGTATGGG ACTGTTGTTG GGTGTGGTTA AAATTTCAGG AATGGGTGTC CTTTATTGTG 24 0 0 

TTCGATCCAT TCGTGGAGCT CTTCATTACC CTGTGTATTG TGGT CAAT AC GATGTTTATG 2460 

GCCATGGATC ATCACGACAT GAATCCGGAA TTAGAGAAGG TGCTGAAAAG TGGTAACTAT 2 52 0 

TTCTTCACGG CCACTTTTGC AATTGAAGCC AG CATGAAAC TGATGGCCAT GAGCCCGAAG 258 0 

TACTACTTCC AGGAAGGCTG GAACATTTTC GATTTCATTA TTGTGGCCTT GTCTCTGCTG 2 64 0 
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GAATTGGGCC TGGAGGGTGT CCAGGGCCTG TCGGTGTTGA GAAGTTTTCG TTTGCTTCGT 270 0 

GTATT CAAAT TGGCAAAATC ATGGCCCACA CTCAATTTAC TCATTTCGAT TATGGGCCGG 27 6 0 

ACAATGGGTG CATTGGGTAA TCTGACATTT GTACTTTGCA TTATCATCTT CATCTTTGCC 2 820 

GTGATGGGAA TGCAACTTTT CGGAAAGAAC TATATTGACC ACAAGGATCG CTTCAAGGAC 2880 

CATGAATTAC CGCGCTGGAA CTTCACCGAC TTCATGCACA GCTTCATGAT TGTGTTC CGA 294 0 

GTGCTGTGCG GAGAGTGGAT CGAGTCCATG TGGGACTGCA TGTATGTGGG CGATGTCAGC 3000 

TGTATACCCT TCTTCTTGGC CACGGTCGTG ATAGGCAATC TTGTGGTTCT TAATCTTTTC 3 0 60 

TTAGCTTTGC TTTTGTCCAA CTTCGGTTCA TCTAGTTTAT CAGCCCCGAC TGCCGACAAT 312 0 

GATACCAATA AAATAGCAGA GGCCTTCAAT CGTATTGCTC GTTTTAAGAA CTGGGTGAAA 318 0 

CGTAATATTG CCGATTGTTT TAAGTTAATT CGAAATAAAT TGACAAAT C A AATAAGTGAC 3 24 0 

CAACCATCAG AACATGGCGA TAATGAACTG GAGTTGGGTC ATGACGAAAT CATGGG CGAT 3300 

GGCTTGATCA AAAAGGGTAT GAAGGGCGAG ACCCAGCTGG AGGTGGCCAT TGGCGATGGC 3 3 60 

ATGGAGTTCA CGATACATGG CGATATGAAA AACAACAAGC CGAAGAAATC AAAATTCATG 342 0 

AACAACACAA CGATGATTGG AAACTCAATA AACCACCAAG ACAATAGACT GGAACATGAG 34 8 0 

CTAAACCATA GAGGTTTGTC CATACAGGAC GATGACACTG C CAG C ATT AA CTCATATGGT 354 0 

AGCCATAAGA ATCGACCATT CAAGGACGAG AGC CACAAGG GCAGCGCCGA GACCAT CGAG 3 600 

GGCGAGGAGA AACGCGACGT CAGCAAAGAG GACCTCGGCC TCGACGAGGA ACTGGACGAG 3 660 

GAGGCCGAGG GCGATGAGGG CCAGCTGGAT GGTGACATTA TCATTCATGC GCAAAACGAC 3720 

GACGAGATAA TCGACGACTA TCCGGCCGAC TGTTTCCCCG ACTCGTACTA CAAGAAGTTT 3 780 

CCGATCTTGG CCGGCGACGA GGACTCGCCG TTCTGGCAAG GATGGGGCAA TTTACGACTG 384 0 

AAAACTTTTC AATTAATTGA AAATAAATAT TTTGAAACCG CAGTTATCAC TATGATTTTA 3 900 

ATGAGTAGCT TAGCTTTGGC CTTAGAAGAT GTTCATTTAC CCGATCGACC TGTCATGCAG 3 96 0 

GATATACTGT ACTACATGGA CAGGATATTT ACGGTGATAT TCTTTTTGGA GATGTTGATC 4 02 0 

AAATGGTTGG CCCTGGGCTT TAAGGTTTAC TTCACCAATG CCTGGTGTTG GCTGGATTTC 40 8 0 
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GTGATTGTCA TGCTATCGCT TATAAATTTG GTTGCCGTTT GGTCGGGCTT AAATGATATA 414 0 

GCCGTGTTTA GATCAATGCG CACACTGCGC GCCCTAAGGC CATTGCGTGC TGTCTCTAGA 42 0 0 

TGGGAGGGTA TGAAAGTTGT CGTGAATGCG CTGGTTCAAG CTATACCGTC CATCTTCAAT 42 6 0 

GTGCTATTGG TGTGTCTGAT ATTTTGGCTT ATTTTTGCCA TTATGGGAGT ACAGCTTTTT 43 2 0 

GCTGGAAAAT ATTTTAAGTG TAAAGATGGT AATGACACTG TGCTGAGCCA TGAAATCATA 43 8 0 

CCGAATCGTA ATGCCTGCAA AAGTGAAAAC TACACCTGGG AAAATTCGGC AATGAACTTC 4440 

GATCATGTAG GTAATGCGTA TCTCTGTCTA TTTCAAGTGG CCACCTTTAA GGGCTGGATC 4500 

CAGATTATGA ACGATGCCAT TGATTCACGA GAGGTGGACA AGCAGCCGAT CCGAGAAACC 456 0 

AATATCTACA TGTATTTATA TTTCGTATTC TTCATTATAT TTGGATCATT TTTCACACTC 462 0 

-=f-AATCTGTTCA TTGGTGTTAT CATTGATAAT TTTAATGAAC AAAAGAAGAA AGCTGGTGGA 468 0 

"TCATTAGAAA TGTTCATGAC AGAAGAT C AG AAAAAGTACT ATAATGCTAT GAAAAAGATG 4 74 0 

i. GGCTCTAAAA AACCATTAAA AGCCATTCCA AGAC CGAGGT GGCGACCACA AGCAATAGTA 48 00 

LTTCGAAATAG TTACAGATAA AAAATTCGAT ATAATCATTA TGTTGTTCAT TGGCTTAAAC 4860 

ilATGTTTACCA TGACCCTCGA TCGGTACGAC GCCTCCGAGG CGTACAACAA TGTCCTCGAC 4 92 0 

:AAACTCAATG GGATATTCGT AGTTATTTTC AGTGGCGAAT GTCTATTAAA AATATTCGCT 4 98 0 

: ;TTACGATATC ACTATTTCAA AGAGCCATGG AATTTATTTG ATGTAGTAGT TGTCATTTTA 5 04 0 

•TCCATCTTAG GTCTTGTACT CAGCGACATC ATTGAGAAGT ATTTCGTATC GCCGACACTG 510 0 

CTCCGTGTGG TGAGAGTGGC CAAAGTGGGT CGTGTCCTGC GTTTAGTCAA GGGTGCCAAG 516 0 

GGTATCCGGA CGTTGCTGTT CGCGTTAGCC ATGTCGTTGC CTGCCTTATT CAACATTTGT 522 0 

CTGTTGCTGT TCTTGGTGAT GTTCATCTTT GCTATCTTTG GCATGTCCTT CTTCATGCAT 52 8 0 

GTCAAAGAGA AGAGCGGCAT AAATGCTGTG TATAATTTTA AGACATTTGG CCAAAGTATG 5340 

ATATTGCTGT TTCAGATGTC TACCTCAGCC GGTTGGGATG GTGTGTTAGA TGCCATTATC 54 0 0 

AATGAGGAAG ATTGCGATCC ACCCGACAAC GACAAGGGCT ATCCGGGCAA TTGTGGTTCA 54 6 0 

GCGACTGTTG GAATTACGTT TCTCCTTTCA TATCTAGTTA TAAGCTTTTT GATAGTTATT 5 52 0 
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AATATGTACA 


TTGCTGTCAT 


TCTCGAGAAC 


TATAGCCAGG 


CTACGGAGGA 


TGTACAGGAG 


5580 


GGTCTCACCG 


ACGACGATTA 


CGATATGTAC 


TACGAGATTT 


GGCAACAATT 


CGATCCGGAG 


5640 


GGCACCCAGT 


ACATACGCTA 


CGACCAGCTG 


TCCGAGTTTC 


TGGACGTGCT 


GGAGCCGCCG 


5700 


CTGCAGATCC 


ACAAGCCGAA 


CAAGTACAAA 


ATCATATCGA 


TGGACATGCC 


GATATGTCGG 


5760 


GGCGACATGA 


TGTACTGTGT 


GGATATATTG 


GATGCCCTGA 


CCAAGGACTT 


CTTTGCGCGC 


5820 


AAGGGTAATC 


CGATCGAGGA 


GACGGGTGAA 


ATTGGTGAGA 


TAGCGGCGCG 


ACCGGACACC 


5880 


GAGGGCTATG 


ATCCGGTGTC 


GTCAACACTG 


TGGCGCCAGC 


GTGAGGAGTA 


CTGCGCCAAG 


5940 


CTGATACAGA 


ATGCGTGGCG 


GCGTTACAAG 


AATGGCCCAC 


CCCAGGAGGG 


TGATGAGGGC 


6000 


GAGGCGGCTG 


GTGGCGAAGA 


TGGTGCTGAA 


GGCGGTGAGG 


GTGAAGGAGG 


CAGCGGCGGC 


6060 


GGCGGCGGTG 


ATGATGGTGG 


CTCAGCGACA 


GGAGCAACGG 


CGGCGGCGGG 


AGCCACATCA 


6120 


- CCCTCAGATC 


CAGATGCCGG 


CGAAGCAGAT 


GGTGCCAGCG 


TCGGCGGCCC 


CCTTAGTCCG 


6180 


: GGCTGTGTTA 


GTGGCGGCAG 


TAATGGCCGC 


CAAACGGCCG 


TACTGGTCGA 


AAGCGATGGT 


6240 


y. TTTGTTACAA 


AAAACGGTCA 


TAAGGTTGTA 


ATACACTCGA 


GATCGCCGAG 


C AT AAC AT C C 


6300 


AGGACGGCAG 


ATGTCTGA 










6318 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ATGACAGAAG ATTCCGACTC GATATCTGAG GAAGAACGCA GTTTGTTCCG TCCCTTCACC 6 0 

CGCGAATCAT TGTTACAAAT CGAACAACGT ATCGCTGAAC ATGAAAAACA AAAGGAGCTG 12 0 
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GAAAGAAAGA GAGCCGCCGA AGGAGAGCAG ATACGATATG ATGACGAGGA CGAAGATGAA 18 0 

GGTCCACAGC CGGATCCCAC ACTTGAACAG GGTGTGCCTA TACCTGTTCG AATGCAGGGC 24 0 

AGCTTCCCGC CGGAATTGGC CTCCACTCCT CTCGAGGATA TCGATCCCTT CTACAGTAAT 3 00 

GTACTGACAT TTGTAGTAAT AAGTAAAGGA AAGGATATTT TTCGTTTTTC TGCCTCAAAA 3 60 

GCAATGTGGC TGCTCGATCC ATTCAATCCG ATACGTCGTG TAGCCATTTA TATTTTAGTG 42 0 

CATCCCTTGT TTTCGTTATT CATTATCACC ACTATTCTAA CTAATTGTAT TTTAATGATA 4 80 

ATGCCGACAA CGCCCACGGT CGAATCCACA GAGGTGATAT TCACCGGAAT CTACACATTT 54 0 

GAATCAGCTG TTAAAGTGAT GGCACGAGGT TTCATTTTAT GCCCGTTTAC GTATCTTAGA 60 0 

^.GATGCATGGA ATTGGCTGGA CTTCGTAGTA ATAGCTTTAG CTTATGTGAC CATGGGCATA 660 

GATTTAGGTA ATCTCGCAGC TTTGAGAACA TTTAGGGTAC TGCGAGCTCT GAAAACCGTA 72 0 

'■_ ' GCCATTGTGC CAGGTCTAAA AACCATTGTC GGTGCTGTCA TTGAATCTGT AAAAAATCTA 78 0 

: - : CGCGATGTGA TAATTTTGAC AATGTTTTCC CTGTCGGTGT TCGCGCTGAT GGGCCTACAA 84 0 

r ATCTATATGG GTGTTCTAAC ACAAAAGTGC ATTAAACGAT TCCCCCTGGA CGGCAGTTGG 90 0 

hGGCAATCTGA CCGATGAAAA CTGGTTTCTA CACAATAGCA ACAGTTCCAA TTGGTTTACG 96 0 

GAGAACGATG GCGAGTCATA TCCGGTGTGC GGGAATGTAT CCGGTGCGGG ACAATGCGGC 102 0 

GAAGATTACG TCTGCCTGCA GGGCTTCGGC CCCAATCCCA ACTACGACTA CACCAGTTTC 10 8 0 

GACTCATTCG GTTGGGCTTT CCTGTCGGCG TTTCGTCTCA TGACCCAAGA TTTCTGGGAG 114 0 

GATCTGTATC AGCACGTGCT GCAAGCAGCT GGACCCTGGC ACATGTTGTT CTTTATAGTC 12 0 0 

ATCATCTTCC TAGGTTCATT CTATCTTGTG AATTTGATTT TGGCCATTGT TGCCATGTCT 12 6 0 

TATGACGAAT TGCAAAAGAA GGCCGAAGAA GAAGAGGCTG CCGAGGAGGA GGCGATCCGA 13 2 0 

GAAGCTGAAG AAGCGGCAGC AGCCAAGGCG GCCAAACTGG AGGAGCGGGC CAATGTAGCA 13 8 0 

GCTCAAGCGG CTCAGGATGC AGCGGATGCC GCTGCGGCAG CTCTGCATCC CGAGATGGCA 144 0 

AAGAGTCCCA CGTACTCTTG CATTAGCTAT GAACTGTTTG TTGGCGGCGA GAAGGGCAAC 15 0 0 

GATGACAACA ACAAGGAGAA GATGTCGATA CGCAGCGTCG AAGTGGAATC GGAGTCGGTG 15 6 0 
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AGCGTTATAC 


AAAGACAACC 


AGCACCTACC 


ACAGCACCCG 


CTACTAAAGT 


CCGTAAAGTT 


1620 


AGCACGACTT 


CCTTATCCTT 


ACCTGGTTCA 


CCATTTAACC 


TACGC CGGGG 


ATCACGTAGT 


1680 


TCACACAAGT 


ACACAATACG 


AAATGGGCGT 


GGACGTTTTG 


GTATACCAGG 


TAGCGATCGC 


1740 


AAGCCATTGG 


TACTGCAAAC 


ATATCAGGAT 


GCCCAGCAGC 


ATTTGCCCTA 


TGCCGATGAC 


1800 


TCGAATGCCG 


TAACACCAAT 


GTCCGAAGAG 


AATGGTGCCA 


TTATAGTACC 


AGCCTACTAT 


1860 


TGTAATTTAG 


GTT CTAGACA 


TTCTTCATAT 


ACCTCGCATC 


AATCAAGAAT 


CTCGTATACA 


1920 


TCACATGGTG 


ATTTATTGGG 


TGGCATGGCG 


GCCATGGGTG 


CCAGCACAAT 


GAC CAAAG AG 


1980 


AGCAAATTGC 


GCAGTCGCAA 


CACACGCAAT 


CAATCAATCG 


GTGCTGCAAC 


CAATGGTGGC 


2040 


AGTAGTACGG 


CCGGTGGTGG 


CTATCCCGAT 


GCCAATCACA 


AGGAACAAAG 


GGATTATGAA 


2100 


ATGGGTCAGG 


ATTATACAGA 


CGAAGCTGGC 


AAAATAAAAC 


ACCACGACAA 


TCCTTTTATC 


2160 


- GAGCCCGTCC 


AAACTCAAAC 


AGTGGTAGAC 


ATGAAAGATG 


TTATGGTCTT 


AAATGATATC 


2220 


yVTTGAACAAG 


CCGCTGGTCG 


GCATAGTCGT 


GCTAGTGAAC 


GAGGTGAGGA 


CGATGACGAA 


2280 


yGATGGTCCCA 


CATTCAAGGA 


CATCGCCCTC 


GAAT AT AT C C 


TAAAAGGCAT 


CGAAATCTTT 


2340 


TGTGTATGGG 


ACTGTTGTTG 


GGTGTGGTTA 


AAATTTCAGG 


AATGGGTCTC 


CTTTATTGTG 


2400 


JrtTTCGATCCAT 


TCGTGGAGCT 


CTTCATTACC 


CTGTGTATTG 


TGGT CAAT AC 


AATGTTCATG 


2460 


"GCCATGGATC 


AT C ACG AC AT 


GAATCCGGAA 


TTGGAGAAGG 


TGCTGAAAAG 


TGGTAACTAT 


2520 


^-TTCTTCACGG 


CCACTTTTGC 


AATTGAGGCC 


AGCATGAAAC 


TGATGGCCAT 


GAGCCCGAAG 


2580 


TACTACTTCC 


AGGAAGGCTG 


GAACATTTTC 


GATTTCATTA 


TTGTGGCCTT 


GTCTCTGCTG 


2640 


GAATTGGGCC 


' TGGAGGGTGT 


CCAGGGCCTG 


TCGGTGTTGA 


GAAGTTTTCG 


TTTGCTTCGT 


2700 


GTATT C AAAT 


TGGCAAAATC 


ATGGCCCACA 


CTGAATTTAC 


TCATTTCGAT 


TATGGGCCGG 


2760 


ACAATGGGTG 


CATTGGGTAA 


TCTGACATTT 


GTACTTTGCA 


TTATCATCTT 


CATCTTTGCC 


2820 


GTGATGGGAA 


TGCAACTTTT 


CGGAAAGAAC 


TATATTGACC 


AC AAGGAT CG 


CTTCAAGGAC 


2880 


CATGAATTAC 


CGCGCTGGAA 


TTTCACCGAC 


TTCATGCACA 


GCTTCATGAT 


TGTGTTCCGA 


2940 


GTGCTGTGCG 


GAGAGTGGAT 


CGAGTCCATG 


TGGGACTGCA 


TGTATGTGGG 


CGATGTCAGC 


3000 
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TGTATACCCT 


TCTTCTTGGC 


CACGGTCGTG 


ATCGGCAATT 


TTGTGGTTCT 


TAATCTTTTC 


3060 


TTAGCTTTGC 


TTTTGTCCAA 


CTTCGGTTCA 


TCTAGTTTAT 


CAGCCCCGAC 


TGCCGACAAT 


3120 


GATACCAATA 


AAATAGCAGA 


GGCCTTCAAT 


CGTATTGCTC 


GTTTTAAGAA 


CTGGGTGAAA 


3180 


CGTAATATTG 


CCGATTGTTT 


TAAGTTAATT 


CGAAATAAAT 


TGACAAATCA 


AATAAGTGAC 


3240 


CAACCATCAG 


AACATGGCGA 


TAATGAACTG 


GAGTTGGGTC 


ATGACGAAAT 


CATGGGCGAT 


3300 


GGCTTGATCA 


AAAAGGGTAT 


GAAGGGCGAG 


ACCCAGCTGG 


AGGTGGCCAT 


TGGCGATGGC 


3360 


ATGGAGTTCA 


CGATACATGG 


CGATATGAAA 


AACAACAAGC 


CCAAGAAATC 


AAAATT CATA 


3420 


AACAACACAA 


CGATGATTGG 


AAACTCAATA 


AACCAC CAAG 


ACAATAGACT 


GGAACATGAG 


3480 


•XTAAACCATA 


GAGGTTTGTC 


CATACAGGAC 


GATGACACTG 


CCAGCATTAA 


CTCATATGGT 


3540 


4;agccataaga 


ATCGACCATT 


CAAGGACGAG 


AGCCACAAGG 


GCAGCGCCGA 


G AC CAT CGAG 


3600 


0§GGCGAGGAGA 


AACGCGACGT 


CAGCAAAGAG 


GACCTCGGCC 


TCGACGAGGA 


ACTGGACGAG 


3660 


"gaggccgagg 


GCGATGAGGG 


CCAGCTGGAT 


GGTGAC AT C A 


TCATTCATGC 


CCAAAACGAC 


3720 


I gacgagataa 


TCGACGACTA 


TCCGGCCGAC 


TGTTTCCCCG 


ACTCGTACTA 


CAAGAAGTTT 


3780 


!--ccgatcttgg 


CCGGCGACGA 


GGACTCGCCG 


TTCTGGCAAG 


GATGGGGCAA 


TTTACGACTG 


3840 


aaaactttto- 


AATTAATTGA 


AAATAAATAT 


TTTGAAACCG 


CAGTTATCAC 


TATGATTTTA 


3900 


* l atgagtagct 


TAGCTTTGGC 


CTTAGAAGAT 


GTTCATTTAC 


CCGATCGACC 


TGTCATGCAG 


3960 


GATATACTGT 


ACTACATGGA 


CAGGATATTT 


ACGGTGATAT 


TCTTTTTGGA 


GATGTTGATC 


4020 


AAATGGTTGG 


CCCTGGGCTT 


TAAGGT CTAC 


TTCACCAATG 


CCTGGTGTTG 


GCTGGATTTC 


4080 


GTGATTGT C A 


TGCTATCGCT 


TATAAATTTG 


GTTGCCGTTT 


GGTCGGGCTT 


AAATGATATA 


4140 


GCCGTGTTTA 


GAT CAATGCG 


CACACTGCGC 


GCCCTAAGGC 


CATTGCGTGC 


TGTCTCTAGA 


4200 


TGGGAGGGTA 


TGAAAGTTGT 


CGTGAATGCG 


CTGGTTCAAG 


CTATACCGTC 


CATCTTCAAT 


4260 


GTGCTATTGG 


TGTGTCTGAT 


ATTTTGGCTT 


ATTTTTGCCA 


TTATGGGAGT 


ACAGCTTTTT 


4320 


GCTGGAAAAT 


ATTTTAAGTG 


TAAAGATGGT 


AATGACACTG 


TGCTGAGCCA 


TGAAAT CATA 


4380 


CCGAATCGTA 


ATGCCTGCAA 


AAGTGAAAAC 


TACACCTGGG 


AAAATTCGGC 


AATGAACTTC 


4440 
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GATCATGTAG 


GTAATGCGTA 


TCTCTGTCTA 


TTTCAAGTGG 


CCACCTTTAA 


GGGCTGGATC 


4500 


CAGATTATGA 


ACGATGCCAT 


TGATT CACGA 


GAGGTGGACA 


AGCAGCCGAT 


CCGAGAAACC 


4560 


AATATCTACA 


TGTATTTATA 


TTTCGTATTC 


TTCATTATAT 


TTGGAT CATT 


TTTCACACTC 


4620 


AATCTGTTCA 


TTGGTGTTAT 


CATTGATAAT 


TTTAATGAAC 


AAAAGAAGAA 


AG C AGGTGGA 


4680 


TCATTAGAAA 


TGTTCATGAC 


AGAAGATCAG 


AAAAAGTACT 


ATAATGCTAT 


GAAAAAGATG 


4740 


GGCTCTAAAA 


AAC CATT AAA 


AGCCATTCCA 


AGACCGAGGT 


GGCGACCACA 


AG C AAT AST A 


4800 


TTCGAAATAG 


TTACAGATAA 


AAAATT CGAT 


AT AAT CATT A 


TGTTGTT CAT 


TGGCTTAAAC 


4860 


ATGTTTACCA 


TGACCCTCGA 


TCGGTACGAC 


GCCTCCGAGG 


CGTACAACAA 


TGTCCTCGAC 


4920 


Jaaactcaatg 


GGATATT CGT 


AGTTATTTTC 


AGTGGCGAAT 


GT C TATT AAA 


AATATTCGCT 


4980 


Tttacgatatc 


ACTATTTCAA 


AGAGCCATGG 


AATTTATTTG 


ATGTAGTAGT 


TGTCATTTTA 


5040 


" TCCATCTTAG 


GTCTTGTACT 


CAGCGACATC 


ATTGAGAAGT 


ATTTCGTATC 


GCCGACACTG 


5100 


CTCCGTGTGG 


TGAGAGTGGC 


CAAAGTGGGT 


CGTGTCCTGC 


GTTTAGTCAA 


GGGTGCCAAG 


5160 


" GGTATCCGGA 


CGTTGCTGTT 


CGCGTTAGCC 


ATGTCGTTGC 


CTGCCTTATT 


CAACATTTGT 


5220 


~ CTGTTGCTGT 


TCTTGGTGAT 


GTTCATCTTT 


GCTATCTTTG 


GCATGTCCTT 


CTTCATGCAT 


5280 


GTCAAAGAGA 


AGAGCGGCAT 


AAATGCTGTG 


TATAATTTTA 


AGACATTTGG 


CCAAAGTATG 


5340 


yOATATTGCTGT 


TTCAGATGTC 


TACCTCAGCC 


GGTTGGGATG 


GTGTGTTAGA 


TGCCATTATC 


5400 


AATGAGGAAG 


ATTGCGATCC 


ACCCGACAAC 


GACAAGGGCT 


ATCCGGGCAA 


TTGTGGTTCA 


5460 


GCGACTGTTG 


GAATTACGTT 


TCTCCTTTCA 


TAT CTAGTT A 


TAAGCTTTTT 


GATAGTTATT 


5520 


AATATGTACA 


TTGCTGTCAT 


TCTCGAGAAC 


TATAGCCAGG 


CTACGGAGGA 


TGTACAGGAG 


5580 


GGTCTCACCG 


ACGACGACTA 


TGATATGTAC 


TACGAGATTT 


GGCAACAATT 


CGATCCGGAG 


5640 


GGTACCCAGT 


ACATAAGATA 


CGACCAGCTG 


TCCGAGTTCC 


TGGACGTGCT 


GGAGCCGCCG 


5700 


CTGCAGATCC 


ACAAGCCGAA 


CAAGTACAAA 


ATCATATCGA 


TGGACATGCC 


GATATGTCGG 


5760 


GGCGACATGA 


TGTACTGTGT 


GGATATATTG 


GATGCCCTGA 


CCAAGGACTT 


CTTTGCGCGC 


5820 


AAGGGT AAT C 


CGATCGAGGA 


GACGGGTGAA 


ATTGGTGAGA 


TTGCGGCGCG 


ACCGGACACC 


5880 
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GAGGGCTATG 


ATCCGGTGTC 


GTCGACACTG 


TGGCGCCAGC 


GTGAGGAGTA 


CTGCGCCAAG 


5940 


CTGATACAGA 


ATGCGTGGCG 


GCGTTACAAG 


AATGGCCCAC 


CCCAGGAGGG 


TGATGAGGGC 


6000 


GAGGCGGCTG 


GTGGCGAAGA 


TGGTGCTGAA 


GGCGGTGAGG 


GTGAAGGCGG 


CAGCGGCGGC 


6060 


GGCGGCGATG 


ATGATGGTGG 


CTCAGCGACG 


GCGGCGGGAG 


CCACATCACC 


CACAGATCCA 


6120 


GATGCCGGCG 


AAGCAGATGG 


TGCCAGCGCC 


GGCAATGGTG 


GCGGCCCCCT 


TAGTCCGGGC 


6180 


TGTGTTAGTG 


GCGGCAGTAA 


TGGCCGCCAA 


ACGGCCGTAC 


TGGT CGAAAG 


CGATGGTTTT 


6240 


GTTACAAAAA 


ACGGTCATAA 


GGTTGTAATA 


CACTCGAGAT 


CGCCGAGCAT 


AACATCCAGG 


6300 


ACGGCAGATG 


TCTGA 










6315 



=-(2) INFORMATION FOR SEQ ID NO : 3 : 

:. (i) SEQUENCE CHARACTERISTICS: 

T: (A) LENGTH: 2105 amino acids 

14; (B) TYPE: amino acid 

: - (C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: linear 

r " (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Leu Gin lie Glu Gin Arg lie Ala 
20 25 30 

Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Ala Glu Gly 
35 40 45 

Glu Gin lie Arg Tyr Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro 
50 55 60 

Asp Pro Thr Leu Glu Gin Gly Val Pro He Pro Val Arg Met Gin Gly 
65 70 75 80 

Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp He Asp Pro 
85 90 95 
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Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr lie Leu Thr Asn Cys lie Leu Met lie 
i45 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 
165 170 175 

lie Tyr Thr Phe Glu Ser Ala Val Lys Val Met Ala Arg Gly Phe lie 
180 185 190 

Leu Cys Pro Phe Thr Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe 
195 200 205 

Val Val lie Ala Leu Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn 
210 215 220 

Leu Ala Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val 
225 230 235 240 

Ala lie Val Pro Gly Leu Lys Thr lie Val Gly Ala Val lie Glu Ser 
245 250 255 

Val Lys Asn Leu Arg Asp Val lie lie Leu Thr Met Phe Ser Leu Ser 
260 265 270 

Val Phe Ala Leu Met Gly Leu Gin lie Tyr Met Gly Val Leu Thr Gin 
275 280 285 

Lys Cys lie Lys Arg Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr 
290 295 300 

Asp Glu Asn Trp Phe Leu His Asn Ser Asn Ser Ser Asn Trp Phe Thr 
305 310 315 320 

Glu Asn Asp Gly Glu Ser Tyr Pro Val Cys Gly Asn Val Ser Gly Ala 
325 330 335 

Gly Gin Cys Gly Glu Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn 
340 345 350 



Pro Asn Tyr Asp Tyr Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu 
355 360 365 

Ser Ala Phe Arg Leu Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin 
370 375 380 

His Val Leu Gin Ala Ala Gly Pro Trp His Met Leu Phe Phe lie Val 
385 390 395 400 

lie lie Phe Leu Gly Ser Phe Tyr Leu Val Asn Leu lie Leu Ala lie 
405 410 415 

Val Ala Met Ser Tyr Asp Glu Leu Gin Lys Lys Ala Glu Glu Glu Glu 
420 425 430 

Ala Ala Glu Glu Glu Ala lie Arg Glu Ala Glu Glu Ala Ala Ala Ala 
435 440 445 

Lys Ala Ala Lys Leu Glu Glu Arg Ala Asn Val Ala Ala Gin Ala Ala 
450 455 460 

Gin Asp Ala Ala Asp Ala Ala Ala Ala Ala Leu His Pro Glu Met Ala 
465 470 475 480 

Lys Ser Pro Thr Tyr Ser Cys lie Ser Tyr Glu Leu Phe Val Gly Gly 
485 490 495 

Glu Lys Gly Asn Asp Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser 
500 505 510 

Val Glu Val Glu Ser Glu Ser Val Ser Val He Gin Arg Gin Pro Ala 
515 520 525 

Pro Thr Thr Ala Pro Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser 
530 535 540 

Leu Ser Leu Pro Gly Ser Pro Phe Asn Leu Arg Arg Gly Ser Arg Ser 
545 550 555 560 

Ser His Lys Tyr Thr He Arg Asn Gly Arg Gly Arg Phe Gly lie Pro 
565 570 575 

Gly Ser Asp Arg Lys Pro Leu Val Leu Gin Thr Tyr Gin Asp Ala Gin 
580 585 590 

Gin His Leu Pro Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser 
595 600 605 ■ 
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Glu Glu Asn Gly Ala lie lie Val Pro Ala Tyr Tyr Cys Asn Leu Gly 
610 615 620 

Ser Arg His Ser Ser Tyr Thr Ser His Gin Ser Arg lie Ser Tyr Thr 
625 630 635 640 

Ser His Gly Asp Leu Leu Gly Gly Met Ala Ala Met Gly Ala Ser Thr 
645 650 655 

Met Thr Lys Glu Ser Lys Leu Arg Ser Arg Asn Thr Arg Asn Gin Ser 
660 665 670 

lie Gly Ala Ala Thr Asn Gly Gly Ser Ser Thr Ala Gly Gly Gly Tyr 
675 680 685 

Pro Asp Ala Asn His Lys Glu Gin Arg Asp Tyr Glu Met Gly Gin Asp 
690 695 700 

Tyr Thr Asp Glu Ala Gly Lys lie Lys His His Asp Asn Pro Phe lie 
705 710 715 720 

Glu Pro Val Gin Thr Gin Thr Val Val Asp Met Lys Asp Val Met Val 
725 730 735 

Leu Asn Asp lie lie Glu Gin Ala Ala Gly Arg His Ser Arg Ala Ser 
740 745 750 

Glu Arg Gly Glu Asp Asp Asp Glu Asp Gly Pro Thr Phe Lys Asp lie 
755 760 765 

Ala Leu Glu Tyr lie Leu Lys Gly lie Glu lie Phe Cys Val Trp Asp 
770 775 780 

Cys Cys Trp Val Trp Leu Lys Phe Gin Glu Trp Val Ser Phe lie Val 
785 790 795 800 

Phe Asp Pro Phe Val Glu Leu Phe lie Thr Leu Cys lie Val Val Asn 
805 810 815 

Thr Met Phe Met Ala Met Asp His His Asp Met Asn Pro Glu Leu Glu 
820 825 830 

Lys Val Leu Lys Ser Gly Asn Tyr Phe Phe Thr Ala Thr Phe Ala lie 
835 840 845 

Glu Ala Ser Met Lys Leu Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin 
850 855 860 



Glu Gly Trp Asn 
865 

Glu Leu Gly Leu 



Arg Leu Leu Arg 
900 

Leu Leu lie Ser 
915 

Thr Phe Val Leu 
930 

Gin Leu Phe Gly 
945 

His Glu Leu Pro 



5 



lie Phe Asp Phe 
870 

Glu Gly Val Gin 
885 

Val Phe Lys Leu 



lie Met Gly Arg 
920 

Cys lie lie lie 
935 

Lys Asn Tyr lie 
950 

Arg Trp Asn Phe 
965 



He He Val Ala 
875 

Gly Leu Ser Val 
890 

Ala Lys Ser Trp 
905 

Thr Met Gly Ala 



Phe He Phe Ala 
940 

Asp His Lys Asp 
955 

Thr Asp Phe Met 
970 



Leu Ser Leu Leu 
880 

Leu Arg Ser Phe 
895 

Pro Thr Leu Asn 
910 

Leu Gly Asn Leu 
925 

Val Met Gly Met 



Arg Phe Lys Asp 
960 

His Ser Phe Met 
975 



He Val Phe Arg Val Leu Cys Gly Glu Trp He Glu Ser Met Trp Asp 
980 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys He Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val He Gly Asn Leu Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 



Leu Ser Asn Phe Gly Ser Ser Ser 
1025 1030 

Asp Thr Asn Lys He Ala Glu Ala 
1045 

Asn Trp Val Lys Arg Asn He Ala 
1060 



Leu Ser Ala Pro Thr Ala Asp Asn 
1035 1040 

Phe Asn Arg He Ala Arg Phe Lys 
1050 1055 

Asp Cys Phe Lys Leu He Arg Asn 
1065 1070 



Lys Leu Thr Asn Gin He Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu He Met Gly Asp Gly Leu He Lys 

1090 1095 1100 



Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala He Gly Asp Gly 
1105 1110 1115 1120 



Met Glu Phe Thr lie His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 
1125 1130 1135 

Ser Lys Phe Met Asn Asn Thr Thr Met lie Gly Asn Ser lie Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser lie 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser lie Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr lie Glu 
1185 1190 1195 1200 

Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 
1205 1210 1215 

Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 

lie lie lie His Ala Gin Asn Asp Asp Glu lie lie Asp Asp Tyr Pro 
1235 1240 1245 

Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro lie Leu Ala 
1250 1255 1260 

Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1265 1270 1275 1280 

Lys Thr Phe Gin Leu lie Glu Asn Lys Tyr Phe Glu Thr Ala Val lie 
1285 1290 1295 

Thr Met lie Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp lie Leu Tyr Tyr Met Asp Arg 
1315 1320 1325 

lie Phe Thr Val lie Phe Phe Leu Glu Met Leu lie Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1345 1350 1355 1360 

Val lie Val Met Leu Ser Leu lie Asn Leu Val Ala Val Trp Ser Gly 
1365 1370 1375 



Leu Asn Asp lie Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala lie Pro Ser lie Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu lie Phe Trp Leu lie Phe Ala lie Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 
1445 1450 1455 

His Glu lie lie Pro Asn Arg Asn Ala Cys Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 

Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 

Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn 
1490 1495 1500 

Asp Ala lie Asp Ser Arg Glu Val Asp Lys Gin Pro lie Arg Glu Thr 
1505 1510 1515 1520 

Asn lie Tyr Met Tyr Leu Tyr Phe Val Phe Phe lie lie Phe Gly Ser 
1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lys 
1570 1575 1580 

Pro Leu Lys Ala lie Pro Arg Pro Arg Trp Arg Pro Gin Ala lie Val 
1585 1590 1595 1600 

Phe Glu lie Val Thr Asp Lys Lys Phe Asp -lie lie lie Met Leu Phe 
1605 1610 1615 

lie Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 



Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly lie Phe Val Val 
1635 1640 1645 

lie Phe Ser Gly Glu Cys Leu Leu Lys lie Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val lie Leu 
1665 1670 1675 1680 

Ser lie Leu Gly Leu Val Leu Ser Asp lie lie Glu Lys Tyr Phe Val 
1685 1690 1635 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 

Leu Arg Leu Val Lys Gly Ala Lys Gly lie Arg Thr Leu Leu Phe Ala 
1715 1720 1725 

Leu Ala Met Ser Leu Pro Ala Leu Phe Asn lie Cys Leu Leu Leu Phe 
1730 1735 1740 

Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His 
1745 1750 1755 1760 

Val Lys Glu Lys Ser Gly lie Asn Ala Val Tyr Asn Phe Lys Thr Phe 
1765 1770 1775 

Gly Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1780 1785 1790 

Asp Gly Val Leu Asp Ala lie lie Asn Glu Glu Asp Cys Asp Pro Pro 
1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 



lie Thr Phe Leu Leu Ser Tyr Leu 
1825 1830 

Asn Met Tyr lie Ala Val lie Leu 
1845 

Asp Val Gin Glu Gly Leu Thr Asp 
1860 



Val lie Ser Phe Leu lie Val lie 
1835 1840 

Glu Asn Tyr Ser Gin Ala Thr Glu 
1850 1855 

Asp Asp Tyr Asp Met Tyr Tyr Glu 
1865 1870 



lie Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr lie Arg Tyr Asp 
1875 1880* 1885 



Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin lie His 
1890 1895 1900 

Lys Pro Asn Lys Tyr Lys lie lie Ser Met Asp Met Pro lie Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp 
1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro He Glu Glu Thr Gly Glu He Gly 
1940 1945 1950 

Glu He Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 

Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys Ala Lys Leu He Gin Asn 
1970 1975 1980 

Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 1995 2000 

Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Gly 
2005 2010 2015 

Gly Ser Gly Gly Gly Gly Gly Asp Asp Gly Gly Ser Ala Thr Gly Ala 
2020 2025 2030 

Thr Ala Ala Ala Gly Ala Thr Ser Pro Ser Asp Pro Asp Ala^Gly Glu 
2035 2040 2045 

Ala Asp Gly Ala Ser Val Gly Gly Pro Leu Ser Pro Gly Cys Val Ser 
2050 2055 2060 

Gly Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly 
2065 2070 2075 2080 

Phe Val Thr Lys Asn Gly His Lys Val Val He His Ser Arg Ser Pro 
2085 2090 2095 

Ser He Thr Ser Arg Thr Ala Asp Val 
2100 2105 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Leu Gin lie Glu Gin Arg lie Ala 
20 25 30 

Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Ala Glu Gly 
35 40 45 

Glu Gin lie Arg Tyr Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro 
50 55 60 

Asp Pro Thr Leu Glu Gin Gly Val Pro lie Pro Val Arg Met Gin Gly 
65 70 75 80 

Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro 
85 90 95 

Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr lie Leu Thr Asn Cys lie Leu Met lie 
145 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 
165 170 175 

lie Tyr Thr Phe Glu Ser Ala Val Lys Val Met Ala Arg Gly Phe lie 
180 185 190 

Leu Cys Pro Phe Thr Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe 
195 200 ' 205 
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Val Val lie Ala Leu Ala Tyr Val Thr Met Gly He Asp Leu Gly Asn 
210 215 220 

Leu Ala Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val 
225 230 235 240 

Ala He Val Pro Gly Leu Lys Thr He Val Gly Ala Val He Glu Ser 
245 250 255 

Val Lys Asn Leu Arg Asp Val He He Leu Thr Met Phe Ser Leu Ser 
260 265 270 

Val Phe Ala Leu Met Gly Leu Gin He Tyr Met Gly Val Leu Thr Gin 
275 280 285 

Lys Cys He Lys Arg Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr 
290 295 300 

Asp Glu Asn Trp Phe Leu His Asn Ser Asn Ser Ser Asn Trp Phe Thr 
305 310 315 320 

Glu Asn Asp Gly Glu Ser Tyr Pro Val Cys Gly Asn Val Ser Gly Ala 
325 330 335 

Gly Gin Cys Gly Glu Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn 
340 345 350 

Pro Asn Tyr Asp Tyr Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu 
355 360 365 

Ser Ala Phe Arg Leu Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin 
370 375 380 

His Val Leu Gin Ala Ala Gly Pro Trp His Met Leu Phe Phe He Val 
385 390 395 400 

He He Phe Leu Gly Ser Phe Tyr Leu Val Asn Leu He Leu Ala lie 
405 410 415 

Val Ala Met Ser Tyr Asp Glu Leu Gin Lys Lys Ala Glu Glu Glu Glu 
420 425 430 

Ala Ala Glu Glu Glu Ala He Arg Glu Ala -Glu Glu Ala Ala Ala Ala 
435 440 445 

Lys Ala Ala Lys Leu Glu Glu Arg Ala Asn Val Ala Ala Gin Ala Ala 
450 455 460 



Gin Asp Ala Ala Asp Ala Ala Ala Ala Ala Leu His Pro Glu Met Ala 
465 470 475 480 

Lys Ser Pro Thr Tyr Ser Cys He Ser Tyr Glu Leu Phe Val Gly Gly 
485 490 495 

Glu Lys Gly Asn Asp Asp Asn Asn Lys Glu Lys Met Ser He Arg Ser 
500 505 510 

Val Glu Val Glu Ser Glu Ser Val Ser Val He Gin Arg Gin Pro Ala 
515 520 525 

Pro Thr Thr Ala Pro Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser 
530 535 540 

Leu Ser Leu Pro Gly Ser Pro Phe Asn Leu Arg Arg Gly Ser Arg Ser 
545 550 555 560 

Ser His Lys Tyr Thr He Arg Asn Gly Arg Gly Arg Phe Gly He Pro 
565 570 575 

Gly Ser Asp Arg Lys Pro Leu Val Leu Gin Thr Tyr Gin Asp Ala Gin 
580 585 590 

Gin His Leu Pro Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser 
595 600 605 

Glu Glu Asn Gly Ala He He Val Pro Ala Tyr Tyr Cys Asn Leu Gly 
610 615 620 

Ser Arg His Ser Ser Tyr Thr Ser His Gin Ser Arg He Ser Tyr Thr 
625 630 635 640 

Ser His Gly Asp Leu Leu Gly Gly Met Ala Ala Met Gly Ala Ser Thr 
645 650 655 

Met Thr Lys Glu Ser Lys Leu Arg Ser Arg Asn Thr Arg Asn Gin Ser 
660 665 670 

He Gly Ala Ala Thr Asn Gly Gly Ser Ser Thr Ala Gly Gly Gly Tyr 
675 680 685 

Pro Asp Ala Asn His Lys Glu Gin Arg Asp- Tyr Glu Met Gly Gin Asp 
690 695 700 

Tyr Thr Asp Glu Ala Gly Lys He -Lys His His Asp Asn Pro Phe He 
705 710 715 720 



Glu Pro Val Gin Thr Gin Thr Val Val Asp Met Lys Asp Val Met Val 
725 730 735 

Leu Asn Asp lie lie Glu Gin Ala Ala Gly Arg His Ser Arg Ala Ser 
740 745 750 

Glu Arg Gly Glu Asp Asp Asp Glu Asp Gly Pro Thr Phe Lys Asp lie 
755 760 765 

Ala Leu Glu Tyr lie Leu Lys Gly lie Glu lie Phe Cys Val Trp Asp 
770 '* 775 780 

Cys Cys Trp Val Trp Leu Lys Phe Gin Glu Trp Val Ser Phe lie Val 
785 790 ~ 795 800 

Phe Asp Pro Phe Val Glu Leu Phe lie Thr Leu Cys lie Val Val Asn 
805 810 * 815 

Thr Met Phe Met Ala Met Asp His His Asp Met Asn Pro Glu Leu Glu 
820 825 830 

Lys Val Leu Lys Ser Gly Asn Tyr Phe Phe Thr Ala Thr Phe Ala lie 
835 ' 840 845 . 

Glu Ala Ser Met Lys Leu Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin 
850 855 860 

Glu Gly Trp Asn lie Phe Asp Phe lie lie Val Ala Leu Ser Leu Leu 
865 ' 870 " 875 880 

Glu Leu Gly Leu Glu Gly Val Gin Gly Leu Ser Val Leu Arg Ser Phe 
885 890 895 

Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn 
900 905 910 

Leu Leu lie Ser lie Met Gly Arg Thr Met Gly Ala Leu Gly Asn Leu 
915 920 925 

Thr Phe Val Leu Cys lie lie lie Phe lie Phe Ala Val Met Gly Met 
930 "* 935 940 

Gin Leu Phe Gly Lys Asn Tyr lie Asp His Lys Asp Arg Phe Lys Asp 
945 " 950 " 955 960 

His Glu Leu Pro Arg Trp Asn Phe Thr Asp Phe Met His Ser Phe Met 
965 970 975 



He Val Phe Arg Val Leu Cys Gly Glu Trp He Glu Ser Met Trp Asp 
980 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys He Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val He Gly Asn Phe Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 

Leu Ser Asn Phe Gly Ser Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn 
1025 1030 1035 1040 

Asp Thr Asn Lys He Ala Glu Ala Phe Asn Arg He Ala Arg Phe Lys 
1045 1050 1055 

Asn Trp Val Lys Arg Asn He Ala Asp Cys Phe Lys Leu He Arg Asn 
1060 1065 1070 

Lys Leu Thr Asn Gin He Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu He Met Gly Asp Gly Leu He Lys 
1090 1095 1100 

Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala He Gly Asp Gly 
1105 1110 1115 1120 

Met Glu Phe Thr He His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 
1125 1130 1135 

Ser Lys Phe He Asn Asn Thr Thr Met He Gly Asn Ser He Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser He 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr He Glu 
1185 1190 1195 1200 

Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 
1205 1210 1215 

Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 
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He He He His Ala Gin Asn Asp Asp Glu He He Asp Asp Tyr Pro 
1235 1240 1245 

Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala 
1250 1255 1260 

Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arq Leu 
1265 1270 1275 ^ 1280 

Lys Thr Phe Gin Leu He Glu Asn Lys Tyr Phe Glu Thr Ala Val He 
1285 1290 1295 

Thr Met He Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp He Leu Tyr Tyr Met Asp Arq 
1315 1320 1325 

He Phe Thr Val He Phe Phe Leu Glu Met Leu He Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1345 1350 1355 1360 

Val He Val Met Leu Ser Leu He Asn Leu Val Ala Val Trp Ser Gly 
1365 1370 1375 

Leu Asn Asp He Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala He Pro Ser He Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu He Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 
1445 1450 1455 

His Glu He He Pro Asn Arg Asn Ala Cys -Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 

Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 



Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp He Gin He Met Asn 
1490 1495 1500 

Asp Ala He Asp Ser Arg Glu Val Asp Lys Gin Pro He Arq Glu Thr 
1505 1510 1515 1520 

Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe He He Phe Gly Ser 
1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lvs 
1570 1575 1580 

Pro Leu Lys Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val 
15 85 1590 1595 1600 

Phe Glu He Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe 
1605 1610 1615 

He Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 

Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly He Phe Val Val 
1635 1640 1645 

He Phe Ser Gly Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu 
1665 1670 1675 1680 

Ser He Leu Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val 
1685 1690 1695 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 

Leu Arg Leu Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala 
1715 1720 1725 

Leu Ala Met Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe 
1730 1735 * 1740 



Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His 
1745 1750 1755 1760 

Val Lys Glu Lys Ser Gly lie Asn Ala Val Tyr Asn Phe Lys Thr Phe 
1765 1770 1775 

Gly Gin Ser Met He Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1780 1785 1790 

Asp Gly Val Leu Asp Ala He He Asn Glu Glu Asp Cys Asp Pro Pro 
1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 

He Thr Phe Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He 
1825 1830 1835 1840 

Asn Met Tyr He Ala Val He Leu Glu Asn Tyr Ser Gin Ala Thr Glu 
1845 1850 1855 

Asp Val Gin Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu 
1860 1865 1870 

He Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr He Arg Tyr Asp 
' 1875 1880 1885 

Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin He His 
1890 1895 1900 

Lys Pro Asn Lys Tyr Lys He He Ser Met Asp Met Pro He Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp 
1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro He Glu Glu Thr Gly Glu He Gly 
1940 1945 1950 

Glu lie Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 

Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys • Ala Lys Leu lie Gin Asn 
1970 1975 1980 

Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 * 1995 2000 



Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Glv 
2005 2010 2015 

Gly Ser Gly Gly Gly Gly Asp Asp Asp Gly Gly Ser Ala Thr Ala Ala 
2020 2025 2030 

Gly Ala Thr Ser Pro Thr Asp Pro Asp Ala Gly Glu Ala Asp Gly Ala 
2035 2040 2045 

Ser Ala Gly Asn Gly Gly Gly Pro Leu Ser Pro Gly Cys Val Ser Glv 
2050 2055 2060 

Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe 
2065 2070 2075 2080 

Val Thr Lys Asn Gly His Lys Val Val He His Ser Arg Ser Pro Ser 
2085 2090 2095 

He Thr Ser Arg Thr Ala Asp Val 
2100 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGGTTGGGCT TTCCTGTC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



o 
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GGGAATTCRA ADATRTTCCA NCCYTC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
CCCGARGAYA THGAYCYNTA YTA 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
SPGTATCGCCT CCTCCTCG 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 



GGGTCTAGAT HTTYGCNATH TTYGGNATG 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGGGAATTCN GGRTCRAAYT GYTGCCA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGT CTAGAR GANCARAARA ARTAYTA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
TCATACTTTG GCCCAATGTC 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CCCGAATTAG AGAAGGTGCT G 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

I (C) STRANDEDNESS: single 

% (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

; ACTATTGCTT GTGGTCGCCA C 

12) INFORMATION FOR SEQ ID NO: 15: 

5 (i) SEQUENCE CHARACTERISTICS: 

;:! (A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CATCNTTRGC NGCNTAGACN ATGAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GATTGAATGG ATCGAGCAGC C 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CGTTTCTCCT TTCATATCTA G 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGAGBGGBGG NCKBGGNCKN GCTCA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not releYant 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 
15 10 15 

Arg Pro Phe Thr Arg Glu Ser Leu Val Gin lie Glu Gin Arg lie Ala 
20 25 30 

Ala Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Glu Gly 
35 40 45 

Glu Val Pro Arg Tyr Gly Arg Lys Lys Lys Gin Lys Glu lie Arg Tyr 
50 55 60 

Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pro Asp Pro Thr Leu Glu 
65 70 75 80 

Gin Gly Val Pro lie Pro Val Arg Leu Gin Gly Ser Phe Pro Pro Glu 
85 90 95 

Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro Tyr Tyr Ser Asn Val 
100 105 110 

Leu Thr Phe Val Val Val Ser Lys Gly Lys Asp lie Phe Arg Phe Ser 
115 120 125 

Ala Ser Lys Ala Met Trp Met Leu Asp Pro Phe Asn Pro lie Arg Arg 
130 135 - 140 

Val Ala lie Tyr lie Leu Val His Pro Leu Phe Ser Leu Phe lie lie 
145 150 155 160 

Thr Thr lie Leu Val Asn Cys lie Leu Met lie Met Pro Thr Thr Pro 
165 170 175 

Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly lie Tyr Thr Phe Glu 
180 185 190 

Ser Ala Val Lys Val Met Ala Arg Gly Phe lie Leu Cys Pro Phe Thr 
195 200 205 

Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe Val Val lie Ala Leu 
210 215 220 

Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn Leu Ala Ala Leu Arg 
225 230 235 240 



Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val Ala lie Val Pro Gly 
245 250 255 

Leu Lys Thr lie Val Gly Ala Val lie Glu Ser Val Lys Asn Leu Arg 
260 265 270 

Asp Val lie He Leu Thr Met Phe Ser Leu Ser Val Phe Ala Leu Met 
275 280 285 

Gly Leu Gin He Tyr Met Gly Val Leu Thr Glu Lys Cys He Lys Lys 
290 295 300 

Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr Asp Glu Asn Trp Asp 
305 310 315 320 

Tyr His Asn Arg Asn Ser Ser Asn Trp Tyr Ser Glu Asp Glu Gly He 
325 330 335 

Ser Phe Pro Leu Cys Gly Asn He Ser Gly Ala Gly Gin Cys Asp Asp 
340 345 350 

Asp Tyr Val Cys Leu Gin Gly Phe Gly Pro Asn Pro Asn Tyr Gly Tyr 
355 360 365 

Thr Ser Phe Asp Ser Phe Gly Trp Ala Phe Leu Ser Ala Phe Arg Leu 
370 375 380 

Met Thr Gin Asp Phe Trp Glu Asp Leu Tyr Gin Leu Val Leu Arg Ala 
385 390 395 400 

Ala Gly Pro Trp His Met Leu Phe Phe He Val He He Phe Leu Gly 
405 410 415 

Ser Phe Tyr Leu Val Asn Leu He Leu Ala He Val Ala Met Ser Tyr 
420 425 430 

Asp Glu Leu Gin Arg Lys Ala Glu Glu Glu Glu Ala Ala Glu Glu Glu 
435 440 445 

Ala He Arg Glu Ala Glu Glu Ala Ala Ala Ala Lys Ala Ala Lys Leu 
450 455 460 

Glu Glu Arg Ala Asn Ala Gin Ala^Gln Ala Ala Ala Asp Ala Ala Ala 
465 470 475 480 

Ala Glu Glu Ala Ala Leu His Pro Glu Met Ala Lys Ser Pro Thr Tyr 



Ser Cys lie Ser Tyr Glu Leu Phe Val Gly Gly Glu Lys Gly Asn Asp 
500 505 510 

Asp Asn Asn Lys Glu Lys Met Ser lie Arg Ser Val Glu Val Glu Ser 
515 520 525 

Glu Ser Val Ser Val lie Gin Arg Gin Pro Ala Pro Thr Thr Ala His 
530 535 540 

Gin Ala Thr Lys Val Arg Lys Val Ser Thr Thr Ser Leu Ser Leu Pro 
545 550 555 560 

Gly Ser Pro Phe Asn lie Arg Arg Gly Ser Arg Ser Ser His Lys Tyr 
565 570 575 

Thr lie Arg Asn Gly Arg Gly Arg Phe Gly lie Pro Gly Ser Asp Arg 
580 585 590 

Lys Pro Leu Val Leu Ser Thr Tyr Gin Asp Ala Gin Gin His Leu Pro 
595 600 605 

Tyr Ala Asp Asp Ser Asn Ala Val Thr Pro Met Ser Glu Glu Asn Gly 
610 615 620 

Ala lie lie Val Pro Val Tyr Tyr Gly Asn Leu Gly Ser Arg His Ser 
625 630 635 640 

Ser Tyr Thr Ser His Gin Ser Arg lie Ser Tyr Thr Ser His Gly Asp 
645 650 655 

Leu Leu Gly Gly Met Ala Val Met Gly Val Ser Thr Met Thr Lys Glu 
660 665 670 

Ser Lys Leu Arg Asn Arg Asn Thr Arg Asn Gin Ser Val Gly Ala Thr 
675 680 685 

Asn Gly Gly Thr Thr Cys Leu Asp Thr Asn His Lys Leu Asp His Arg 
690 695 700 

Asp Tyr Glu lie Gly Leu Glu Cys Thr Asp Glu Ala Gly Lys lie Lys 
705 710 715 720 

His His Asp Asn Pro Phe lie Glu Pro Val Gin Thr Gin Thr Val Val 
725 730 735 



Asp Met Lys Asp Val 
740 

Gly Arg His Ser Arg 
755 

Gly Pro Thr Phe Lys 
770 

Asp Val Phe Cys Val 
785 

Glu Trp Val Ser Leu 
805 

Thr Leu Cys lie Val 
820 

Asp Met Asn Lys Glu 
835 

Phe Thr Ala Thr Phe 
850 

Ser Pro Lys Tyr Tyr 
865 

lie Val Ala Leu Ser 
885 

Leu Ser Val Leu Arg 
900 

Lys Ser Trp Pro Thr 
915 

Met Gly Ala Leu Gly 
930 

lie Phe Ala Val Met 
945 



His Lys Asp Arg Phe 
965 



Met Val Leu Asn Asp lie 
745 



Ala Ser Asp Arg Gly Glu 
760 



Asp Lys Ala Leu Glu Val 
775 



Trp Asp Cys Cys Trp Val 
790 795 



lie Val Phe Asp Pro Phe 
810 



Val Asn Thr Met Phe Met 
825 



Met Glu Arg Val Leu Lys 
840 



Ala He Glu Ala Thr Met 
855 



Phe Gin Glu Gly Trp Asn 
870 875 

Leu Leu Glu Leu Gly Leu 
890 



Ser Phe Arg Leu Leu Arg 
905 



Leu Asn Leu Leu He Ser 
920 



Asn Leu Thr Phe Val Leu 
935 

Gly Met Gin Leu Phe Gly 

950 955 



Pro Asp Gly Asp Leu Pro 
970 



He Glu Gin Ala Ala 
750 

Asp Asp Asp Glu Asp 
765 

He Leu Lys Gly He 
780 

Trp Leu Lys Phe Gin 

800 

Val Glu Leu Phe He 
815 

Ala Met Asp His His 
830 

Ser Gly Asn Tyr Phe 
845 

Lys Leu Met Ala Met 
860 

He Phe Asp Phe He 
880 

Glu Gly Val Gin Gly 

895 

Val Phe Lys Leu Ala 
910 

He Met Gly Arg Thr 
925 

Cys He He He Phe 
940 

Lys Asn Tyr His Asp 
960 



Arg Trp Asn Phe Thr 
975 



Asp Phe Met His Ser Phe Met He Val Phe Arg Val Leu Cys Gly Glu 
980 985 - 990 
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Trp lie Glu Ser Met Trp Asp Cys Met Tyr Val Gly Asp Val Ser Cys 
995 1000 1005 

lie Pro Phe Phe Leu Ala Thr Val Val lie Gly Asn Leu Val Val Leu 
1010 1015 1020 

Asn Leu Phe Leu Ala Leu Leu Leu Ser Asn Phe Gly Ser Ser Ser Leu 
1025 1030 1035 1040 

Ser Ala Pro Thr Ala Asp Asn Asp Thr Asn Lys He Ala Glu Ala Phe 
1045 1050 1055 

Asn Arg He Gly Arg Phe Lys Ser Trp Val Lys Arg Asn He Ala Asp 
1060 1065 1070 

Cys Phe Lys Leu He Arg Asn Lys Leu Thr Asn Gin He Ser Asp Gin 
1075 1080 1085 

Pro Ser Glu His Gly Asp Asn Glu Leu Glu Leu Gly His Asp Glu He 
1090 1095 1100 

Leu Ala Asp Gly Leu He Lys Lys Gly He Lys Glu Gin Thr Gin Leu 
1105 1110 1115 1120 

Glu Val Ala He Gly Asp Gly Met Glu Phe Thr He His Gly Asp Met 
1125 1130 1135 

Lys Asn Asn Lys Pro Lys Lys Ser Lys Tyr Leu Asn Asn Ala Thr Asp 
1140 1145 1150 

Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys Asn Arg Pro 
1155 1160 1165 

Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr Met Glu Gly Glu 
1170 1175 1180 

Glu Lys Arg Asp Ala Ser Lys Glu Asp Leu Gly Leu Asp Glu Glu Leu 
1185 1190 1195 1200 

Asp Glu Glu Gly Glu Cys Glu Glu Gly Pro Leu Asp Gly Asp lie He 
1205 1210 1215 

He His Ala His Asp Glu Asp He Leu Asp Glu Tyr Pro Ala Asp Cys 
1220 1225 1230 

Cys Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala Gly Asp Asp 
1235 1240 1245 



i 



Asp Ser Pro Phe Trp 
1250 

Arg Leu lie Glu Asp 
1265 

Leu Met Ser Ser Leu 
128! 

Arg Pro lie Leu Gin 
1300 

Val lie Phe Phe Leu 
1315 
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Gin Gly Trp Gly Asn Leu 
1255 

Lys Tyr Phe Glu Thr Ala 
1270 127! 

Ala Leu Ala Leu Glu Asp 
j 1290 

Asp lie Leu Tyr Tyr Met 
1305 

Glu Met Leu lie Lys Trp 
1320 



Arg Leu Lys Thr Phe 
1260 

Val lie Thr Met lie 
3 1280 

Val His Leu Pro Gin 
1295 

Asp Arg lie Phe Thr 
1310 

Leu Ala Leu Gly Phe 
1325 



Lys Val Tyr Leu Thr Asn Ala Trp Cys Trp Leu Asp Phe Val lie Val 
1330 1335 1340 

Met Val Ser Leu He Asn Phe Val Ala Ser Leu Val Gly Ala Gly Gly 
1345 1350 1355 1360 

He Gin Ala Phe Lys Thr Met Arg Thr Leu Arg Ala Leu Arg Pro Leu 
1365 1370 1375 

Arg Ala Met Ser Arg Met Gin Gly Met Arg Val Val Val Asn Ala Leu 
1380 1385 1390 

Val Gin Ala He Pro Ser He Phe Asn Val Leu Leu Val Cys Leu He 
1395 1400 1405 

Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe Ala Gly Lys 
1410 1415 1420 

Tyr Phe Lys Cys Glu Asp Met Asn Gly Thr Lys Leu Ser His Glu lie 
1425 1430 1435 1440 

He Pro Asn Arg Asn Ala Cys Glu Ser Glu Asn Tyr Thr Trp Val Asn 
1445 1450 1455 



Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu Cys Leu Phe 
1460 1465 1470 

Gin Val Ala Thr Phe Lys Gly Trp He Gin He Met Asn Asp Ala He 
1475 1480 1485 

Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr Asn He Tyr 
1490 1495 1500 
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Met Tyr Leu Tyr Phe Val Phe Phe lie lie Phe Gly Ser Phe Phe Thr 
1505 1510 1515 1520 

Leu Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn Glu Gin Lys 
1525 1530 1535 



Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu Asp Gin Lys 
1540 1545 1550 

Lys Tyr Tyr Ser Ala Met Lys Lys Met Gly Ser Lys Lys Pro Leu Lys 
1555 1560 1565 

Ala lie Pro Arg Pro Arg Trp Arg Pro Gin Ala lie Val Phe Glu lie 
1570 1575 1580 

Val Thr Asp Lys Lys Phe Asp lie lie lie Met Leu Phe lie Gly Leu 
1585 1590 1595 1600 

Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser Asp Thr Tyr 
1605 1610 1615 

Asn Ala Val Leu Asp Tyr Leu Asn Ala lie Phe Val Val lie Phe Ser 
1620 1625 1630 

Ser Glu Cys Leu Leu Lys lie Phe Ala Leu Arg Tyr His Tyr Phe lie 
1635 1640 1645 

Glu Pro Trp Asn Leu Phe Asp Val Val Val Val lie Leu Ser lie Leu 
1650 1655 1660 

Gly Leu Val Leu Ser Asp lie lie Glu Lys Tyr Phe Val Ser Pro Thr 
1665 1670 1675 1680 

Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val Leu Arg Leu 
1685 1690 1695 

Val Lys Gly Ala Lys Gly lie Arg Thr Leu Leu Phe Ala Leu Ala Met 
1700 1705 1710 

Ser Leu Pro Ala Leu Phe Asn lie Cys Leu Leu Leu Phe Leu Val Met 
1715 1720 1725 

Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His Val Lys Glu 
1730 1735 . 1740 

Lys Ser Gly lie Asn Asp Val Tyr Asn Phe Lys Thr Phe Gly Gin Ser 
1745 1750 1755 1760 



Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp Asp Gly Val 
1765 1770 1775 

Leu Asp Ala lie lie Asn Glu Glu Ala Cys Asp Pro Pro Asp Asn Asp 
1780 1785 1790 

Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly lie Thr Phe 
1795 1800 1805 

Leu Leu Ser Tyr Leu Val lie Ser Phe Leu lie Val lie Asn Met Tyr . 
1810 1815 1820 

lie Ala Val lie Leu Glu Asn Tyr Ser Gin Ala Thr Glu Asp Val Gin 
1825 1830 1835 1840 

Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu lie Trp Gin 
1845 1850 1855 

Gin Phe Asp Pro Glu Gly Thr Gin Tyr lie Arg Tyr Asp Gin Leu Ser 
I860 1865 1870 

Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin lie His Lys Pro Asn 
1875 1880 1885 

Lys Tyr Lys lie He Ser Met Asp He Pro He Cys Arg Gly Asp Leu 
1890 1895 1900 

Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp Phe Phe Ala 
1905 1910 1915 1920 

Arg Lys Gly Asn Pro He Glu Glu Thr Gly Glu He Gly Glu lie Ala 
1925 1930 1935 

Ala Arg Pro Asp Thr Glu Gly Tyr Glu Pro Val Ser Ser Thr Leu Trp 
1940 1945 1950 

Arg Gin Arg Glu Glu Tyr Cys Ala Arg Leu He Gin His Ala Trp Arg 
1955 1960 1965 

Lys His Lys Ala Arg Gly Glu Gly Gly Gly Ser Phe Glu Pro Asp Thr 
1970 1975 1980 

Asp His Gly Asp Gly Gly Asp Pro Asp Ala Gly Asp Pro Ala Pro Asp 
1985 1990 1995 2000 

Glu Ala Thr Asp Gly Asp Ala Pro Ala Gly Gly Asp Gly Ser Val Asn 
2005 2010 - 2015 
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Gly Thr Ala Glu Gly Ala Ala Asp Ala Asp Glu Ser Asn Val Asn Ser 
2020 2025 2030 

Pro Gly Glu Asp Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
2035 2040 2045 

Ala Ala Gly Thr Thr Thr Ala Gly Ser Pro Gly Ala Gly Ser Ala Gly 
2050 2055 2060 

Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe Val Thr Lys Asn 
2065 2070 2075 2080 

Gly His Lys Val Val lie His Ser Arg Ser Pro Ser lie Thr Ser Arg 
2085 2090 2095 

Thr Ala Asp Val 
2100 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule encoding a 
voltage- sensitive sodium channel of Musca domes tica, 
wherein said voltage-sensitive sodium channel is capable 
of conferring sensitivity or resistance to an insecticide 
in Musca domes tica. 

2. The isolated nucleic acid molecule of claim 1 
wherein said nucleic acid is deoxyribonucleic acid. 

3 . The isolated nucleic acid molecule of claim 2 
wherein said deoxyribonucleic acid is cDNA. 

4 . The isolated nucleic acid molecule of claim 1 
wherein said voltage-sensitive sodium channel confers 
susceptibility to an insecticide in Musca domestica. 

5. The isolated nucleic acid molecule of claim 4 
wherein said nucleic acid molecule has a nucleotide 
sequence as shown in SEQ ID NO : 1 . 

6 . The isolated nucleic acid molecule of claim 4 
wherein said nucleic acid molecule encodes an amino acid 
sequence as shown in SEQ ID NO : 3 . 

7. The isolated nucleic acid molecule of claim 1 
wherein said voltage-sensitive sodium channel confers 
resistance to an insecticide in Musca domestica. 

8 . The isolated nucleic acid molecule of claim 7 
wherein said nucleic acid molecule has a nucleotide 
sequence as shown in SEQ ID NO : 2 . 
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9. The isolated nucleic acid molecule of claim 7 
wherein said nucleic acid molecule encodes an amino acid 
sequence as shown in SEQ ID NO : 4 . 

10 . The isolated nucleic acid molecule of claim 7 
wherein said nucleic acid molecule has the nucleotide 
sequence of a second nucleic acid molecule with one or 
more mutations therein, wherein said second nucleic acid 
molecule encodes an insecticide sensitive voltage- 
sensitive sodium channel of Musca domestica, and wherein 
said one or more mutations in said second nucleic acid 
molecule render the resulting voltage-sensitive sodium 
channel resistant to an insecticide. 

11. The isolated nucleic acid molecule of claim 10 
wherein said nucleotide sequence of said second nucleic 
acid molecule encodes amino acid SEQ ID NO : 3 , and wherein 
said one or more mutations in said second nucleic acid 
molecule are selected from the group consisting of a 
substitution for amino acid residue 1014 of SEQ ID NO : 3 , a 
substitution for amino acid residue 114 0 of SEQ ID NO : 3 , a 
substitution for amino acid residue 2023 of SEQ ID NO : 3 , a 
deletion of one or more of amino acid residues 2031-2034 
of SEQ ID NO : 3 , a substitution for amino acid residue 2 042 
of SEQ ID NO: 3, a substitution for amino acid residue 2054 
of SEQ ID N0:3, and an insertion of one to three amino 
acid residues between amino acid residues 2055 and 2 056 of 
SEQ ID NO : 3 . 

12 . The isolated nucleic acid molecule of claim 1 
wherein said nucleic acid is • ribonucleic acid. 



13 . The isolated nucleic acid molecule of claim 12 
wherein said ribonucleic acid is mRNA. 



) 
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14 . An antisense nucleic acid molecule 
complementary to at least a portion of the mRNA of 
claim 13 . 

15. An expression vector comprising the antisense 
nucleic acid molecule of claim 14. 

16. The expression vector of claim 15 wherein the 
expression vector is a baculovirus . 

17. A method of decreasing expression of a voltage- 
sensitive sodium channel in an insect, said method 
comprising infecting an insect with the baculovirus vector 
of claim 16, wherein infection of said insect by said 
baculovirus results in incorporation of said antisense 
nucleic acid molecule into the genome of said insect, 
thereby blocking expression of voltage-sensitive sodium 
channels in said insect cell. 

18 . A ribozyme having a recognition sequence 
complementary to a portion of the mRNA of claim 13 . 

19 . An expression vector comprising the ribozyme of 
claim 18 . 

20. The expression vector of claim 19 wherein the 
expression vector is a baculovirus. 

21. A method of decreasing expression of a voltage- 
sensitive sodium channel in an insect, said method 
comprising infecting an insect with the baculovirus vector 
of claim 20, wherein infection of said insect by said 
baculovirus results in expression of said ribozyme in said 
insect, thereby decreasing expression of voltage- sensit ive 
sodium channels in said insect cell . 



22. A cell comprising the nucleic acid molecule of 
claim 1 . 



23. The cell of claim 22 wherein the cell is a 
Xenopus oocyte . 

24. The cell of claim 22 wherein the cell is an 
insect cell line. 

25. The cell of claim 24 wherein said insect cell 
line is selected from the group consisting of a Drosophila. 
Schneider cell line, a Drosophila K c cell line, an Sf9 cell 
line, and a High Five® cell line. 

26. An expression vector comprising the nucleic 
acid molecule of claim 1 . 

27. The expression vector of claim 26 wherein said 
expression vector is selected from the group consisting of 
a plasmid and a virus . 

28. A cell comprising the expression vector of 
claim 26 . 

29. The cell of claim 28 wherein the cell is a 
Xenopus oocyte . 

30. The cell of claim 28 wherein the cell is an 
insect cell line. 

31. The cell of claim 3 0 wherein said insect cell 
line is selected from the group consisting of a Drosophila. 
Schneider cell line, a Drosophila K c cell line, an Sf9 cell 
line, and a High Five® cell line. 
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32. The isolated nucleic acid molecule of claim 1 
wherein said insecticide is selected from the group 
consisting of DDT, DDT analogs, and pyrethroids . 

33. A method of producing a voltage-sensitive 
sodium channel, said method comprising: 

introducing the nucleic acid molecule of 
claim 1 into a cell; and 

allowing said cell to express said nucleic acid 
molecule resulting in the production of a voltage- 
sensitive sodium channel in said cell. 

34. The method of claim 33 wherein the cell is a 
Xenopus oocyte . 

35. The method of claim 33 wherein the cell is an 
insect cell line. 

36. The method of claim 35 wherein said insect cell 
line is selected from the group consisting of a Drosophila 
Schneider cell line, a Drosophila. K c cell line, an Sf9 cell 
line, and a High Five® cell line. 

37. A method of producing a voltage-sensitive 
sodium channel, said method comprising: 

introducing the nucleic acid molecule of 
claim 1 and a second nucleic acid molecule encoding a tip 
E protein into a cell; and 

allowing said cell to coexpress said nucleic 
acid molecule and said second nucleic acid molecule, 
resulting in the production of a voltage-sensitive sodium 
channel in said cell. 

38. The method of claim 37 wherein the cell is a 
Xenopus oocyte . 
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39. The method of claim 37 wherein the cell is an 
insect cell line. 

40. The method of claim 39 wherein said insect cell 
line is selected from the group consisting of a Drosophila 
Schneider cell line, a Drosophila K 0 cell line, an Sf9 cell 
line, and a High Five® cell line. 

41. A method of screening a chemical agent for the 
ability of the chemical agent to modify sodium channel 
function, said method comprising: 

introducing the nucleic acid molecule of claim 1 
into a host cell; 

expressing said voltage- sensitive sodium channel 
encoded by said nucleic acid molecule in the host cell so 
as to result in the functional expression of a voltage - 
sensitive sodium channel in the host cell; 

exposing the cell to a chemical agent; and 

evaluating the exposed cell to determine if the 
chemical agent modifies the function of the voltage- 
sensitive sodium channel . 

42. ' The method of claim 41 wherein the cell is a 
Xenopus oocyte . 

43. The method of claim 41 wherein the cell is an 
insect cell line. 

44. The method of claim 43 wherein said insect cell 
line is selected from the group consisting of a Drosophila 
Schneider cell line, a Drosophila K c cell line, an Sf9 cell 
line, and a High Five® cell line. 

45. The method of claim 41 wherein said evaluation 
comprises monitoring sodium transport through said 
voltage-sensitive sodium channel. 
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46. The method of claim 41 wherein said evaluation 
comprises monitoring quanidinium transport through said 
voltage-sensitive sodium channel. 

47. A method of screening a chemical agent for the 
ability of the chemical agent to modify sodium channel 
function, said method comprising: 

introducing the nucleic acid molecule of claim 1 and 
a second nucleic acid molecule encoding a tip E protein 
into a host cell; 

allowing said host cell to coexpress said nucleic 
acid molecule and said second nucleic acid molecule so as 
to result in the functional expression of a voltage- 
sensitive sodium channel in the host cell; 

exposing the cell to a chemical agent; and 

evaluating the exposed cell to determine if the 
chemical agent modifies the function of the voltage- 
sensitive sodium channel . 

48. The method of claim 47 wherein the cell is a 
Xenopus oocyte . 

49. The method of claim 47 wherein the cell is an 
insect cell line. 

50. The method of claim 49 wherein said insect cell 
line is selected from the group consisting of a Drosophila. 
Schneider cell line, a Drosophila. K c cell line, an Sf9 cell 
line, and a High Five® cell line. 

51. The method of claim 47 wherein said evaluation 
comprises monitoring sodium transport through said 
voltage-sensitive sodium channel. 



52. The method of claim 47 wherein said evaluation 
comprises monitoring quanidinium transport through said 
voltage-sensitive sodium channel. 

53 . A method of obtaining DNA encoding a voltage- 
sensitive sodium channel, said method comprising: 

selecting a DNA molecule encoding a voltage- 
sensitive sodium channel of an insect, said DNA molecule 
having a nucleotide sequence selected from the group 
consisting of SEQ ID NO : 1 and SEQ ID NO: 2; 

designing an oligonucleotide probe for a voltage - 
sensitive sodium channel based on SEQ ID NO : 1 or SEQ ID 
NO : 2 ; 

probing a genomic or cDNA library of an insect with 
the oligonucleotide probe; and 

obtaining clones from said library that are 
recognized by said oligonucleotide probe, so as to obtain 
DNA encoding a voltage-sensitive sodium channel. 

54. A method of obtaining DNA encoding a voltage- 
sensitive sodium channel, said method comprising: 

selecting a DNA molecule encoding a voltage- 
sensitive sodium channel of an insect, said DNA molecule 
having a nucleotide sequence selected from the group 
consisting of SEQ ID NO : 1 and SEQ ID NO : 2 ; 

designing degenerate oligonucleotide primers 
based on SEQ ID NO : 1 or SEQ ID NO : 2 ; and 

utilizing said oligonucleotide primers in a 
polymerase chain reaction on a DNA sample to identify 
homologous DNA encoding a voltage-sensitive sodium channel 
in said sample. 

55 . An isolated nucleic acid molecule encoding a 
voltage-sensitive sodium channel of an insect, said 
nucleic acid molecule encoding a first amino acid sequence 



having at least 95% amino acid identity to a second amino 
acid sequence, said second amino acid sequence being as 
shown in SEQ ID NO : 3 . 

56. An isolated nucleic acid molecule encoding a 
voltage -sensitive sodium channel of an insect, said 
nucleic acid molecule encoding a first amino acid sequence 
having at least 95% amino acid identity to a second amino 
acid sequence, said second amino acid sequence being as 
shown in SEQ ID NO : 4 . 

57. An isolated voltage-sensitive sodium channel of 
Musca domestica, wherein said voltage-sensitive sodium 
channel is capable of conferring sensitivity or resistance 
to an insecticide in Musca domestica . 

58. The voltage-sensitive sodium channel of 
claim 57 wherein said voltage-sensitive sodium channel 
confers susceptibility to an insecticide in Musca 
domestica . 

59. The voltage-sensitive sodium channel of 
claim 58 wherein said voltage-sensitive sodium channel is 
encoded by a nucleotide sequence as shown in SEQ ID NO : 1 . 

60. The voltage-sensitive sodium channel of 
claim 58 wherein said voltage-sensitive sodium channel is 
comprised of a protein having an amino acid sequence as 
shown in SEQ ID NO : 3 . 

61. The voltage-sensitive sodium channel of 
claim 57 wherein said voltage-sensitive sodium channel 
confers resistance to an insecticide in Musca domestica . 



- 91 - 

62. The voltage -sensitive sodium channel of 
claim 61 wherein said voltage- sensitive sodium channel is 
encoded by a nucleotide sequence as shown in SEQ ID NO : 2 . 

63 . The voltage-sensitive sodium channel of 
claim 61 wherein said voltage- sensitive sodium channel is 
comprised of a protein having an amino acid sequence as 
shown in SEQ ID NO : 4 . 

64. The voltage-sensitive sodium channel of 
claim 61 wherein said voltage-sensitive sodium channel is 
encoded by a nucleic acid molecule having the nucleotide 
sequence of a second nucleic acid molecule with one or 
more mutations therein, wherein said second nucleic acid 
molecule encodes an insecticide sensitive voltage- 
sensitive sodium channel of Musca domes tica, and wherein 
said one or more mutations in said second nucleic acid 
molecule render the resulting voltage- sensitive sodium 
channel resistant to an insecticide. 

65. The voltage-sensitive sodium channel of 
claim 64 wherein said nucleotide sequence of said second 
nucleic acid molecule encodes amino acid SEQ ID NO : 3 , and 
wherein said one or more mutations in said second nucleic 
acid molecule are selected from the group consisting of a 
substitution for amino acid residue 1014 of SEQ ID NO : 3 , a 
substitution for amino acid residue 1140 of SEQ ID NO : 3 , a 
substitution for amino acid residue 2023 of SEQ ID N0:3, a 
deletion of one or more of amino acid residues 2031-2034 
of SEQ ID NO : 3 , a substitution for amino acid residue 2042 
of SEQ ID NO: 3, a substitution for amino acid residue 2054 
of SEQ ID NO: 3, and an insertion of one to three amino 
acid residues between amino acid residues 2055 and 2056 of 
SEQ ID NO : 3 . 



66. The voltage-sensitive sodium channel of 
claim 57 wherein said insecticide is selected from the 
group consisting of DDT, DDT analogs, and pyrethroids . 



67. An antibody or fragment thereof specific for 
the voltage-sensitive sodium channel of claim 57. 

68. The antibody of claim 67 wherein said antibody 
comprises a monoclonal antibody. 

69. The antibody of claim 67 wherein said antibody 
comprises a polyclonal antibody. 

70. A method of detecting presence of a voltage- 
sensitive sodium channel in a sample, said method 
comprising : 

contacting a sample with the antibody or fragment 
thereof of claim 67, wherein said antibody or fragment 
thereof binds to any of said voltage-sensitive sodium 
channel present in said sample, forming a complex 
therewith; and 

detecting said complex, thereby detecting presence 
of a voltage-sensitive sodium channel in said sample. 

71. An isolated voltage- sensitive sodium channel of 
Musca domestical, wherein the voltage-sensitive sodium 
channel is comprised of a protein having a first amino 
acid sequence with at least 95% amino acid identity to a 
second amino acid sequence, said second amino acid 
sequence being as shown in SEQ ID NO : 3 . 

72. An isolated voltage-sensitive sodium channel of 
Musca. domestica., wherein the voltage-sensitive sodium 
channel is comprised of a protein having a first amino 
acid sequence with at least 95% amino acid identity to a 
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second amino acid sequence, said second amino acid 
sequence being as shown in SEQ ID N0:4. 

73 . A plasmid designated pPJIl and deposited with 
the American Type Culture Collection under Accession No. 



74. A Kpnl/Aatll restriction fragment of the 
plasmid designated pPJIl of claim 73, said restriction 
fragment being about 3 62 0 bp. 

75 . A plasmid designated pPJI2 and deposited with 
the American Type Culture Collection under Accession No. 



76. An Aatll/Sphll restriction fragment of the 
plasmid designated pPJI2 of claim 75, said restriction 
fragment being about 2700 bp. 

77. An isolated nucleic acid molecule consisting of 
a Kpnl/Aatll restriction fragment of about 3 620 bp of the 
plasmid designated pPJIl ligated at the Aatll site to the 
Aatll site of an Aatll/Sphll restriction fragment of about 
2700 bp of the plasmid " designated pPJI2 . 



) 
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INSECT SODIUM CHANNELS FROM INSECTICIDE -SUSCEPTIBLE 
AND INSECTICIDE -RESISTANT HOUSE FLIES 



5 ABSTRACT OF THE DISCLOSURE 

The present invention is directed to isolated 
nucleic acid molecules encoding a voltage-sensitive sodium 
channel (VSSC) of Musca domestica, the VSSC being capable 

10 of conferring insecticide susceptibility or insecticide 

resistance to Musca domestica, as well as to the isolated 
voltage- sensitive sodium channels of Musca domestica 
encoded thereby. Nucleic acid molecules encoding 
insecticide susceptible VSSCs and nucleic acid molecules 

15 encoding insecticide resistant VSSCs are provided. 

Methods for increasing or decreasing the expression of 
functional voltage- sensitive sodium channels in host cells 
are also provided, as well as methods using the sodium 
channels. Also provided is a method for isolating other 

20 voltage- sensitive sodium channels. 
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<210> 1 
<211> 6318 
<212> DNA 
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atgacagaag attccgactc gatatctgag gaagaacgca gtttgttccg tcccttcacc 60 
cgcgaatcat tgttacaaat cgaacaacgt atcgctgaac atgaaaaaca aaaggagctg 12 0 
gaaagaaaga gagccgccga aggagagcag atacgatatg atgacgagga cgaagatgaa 18 0 
ggtccacagc cggatcccac acttgaacag ggtgtgccta tacctgttcg aatgcagggc 24 0 
agcttcccgc cggaattggc ctccactcct ctcgaggata tcgatccctt ctacagtaat 300 
gtactgacat ttgtagtaat aagtaaagga aaggatattt ttcgtttttc tgcctcaaaa 360 
gcaatgtggc tgctcgatcc attcaatccg atacgtcgtg tagccattta tattttagtg 42 0 
catcccttgt tttcgttatt cattatcacc actattctaa ctaattgtat tttaatgata 480 
atgccgacaa cgcccacggt cgaatccaca gaggtgatat tcaccggaat ctacacattt 54 0 
gaatcagctg ttaaagtgat ggcacgaggt ttcattttat gcccgtttac gtatcttaga 600 
gatgcatgga attggctgga cttcgtagta atagctttag cttatgtgac catgggcata 66 0 
gatttaggta atctcgcagc tttgagaaca tttagggtac tgcgagctct gaaaaccgta 72 0 
gccattgtgc caggtctaaa aaccattgtc ggtgctgtca ttgaatctgt aaaaaatcta 780 
cgcgatgtga taattttgac aatgttttcc ctgtcggtgt tcgcgctgat gggcctacaa 84 0 
atctatatgg gtgttctaac acaaaagtgc attaaacgat tccccctgga cggcagttgg 900 
ggcaatctga ccgatgaaaa ctggtttcta cacaatagca acagttccaa ttggtttacg 96 0 
gagaacgatg gcgagtcata tccggtgtgc gggaatgtat ccggtgcggg acaatgcggc 102 0 
gaggattacg tctgcctgca gggcttcggc cccaatccca actacgacta caccagtttc 1080 
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gattcattcg gttgggcttt cctgtcggcg tttcgtctca tgacccaaga tttctgggag 1140 
gatctgtatc agcacgtgct gcaagcagct ggaccctggc acatgttgtt ctttatagtc 12 00 
atcatcttcc taggttcatt ctatcttgtg aatttgattt tggccattgt tgccatgtct 1260 
tatgacgaat tgcaaaagaa ggccgaagaa gaagaggctg ccgaggagga ggcgatacga 132 0 
gaagctgaag aagcggcagc agccaaggcg gccaaactgg aggagcgggc caatgtagca 13 8 0 
gctcaagcgg ctcaggatgc agcggatgcc gctgcggcag ctctgcatcc cgagatggca 144 0 
aagagtccca cgtactcttg cattagctat gaactgtttg ttggcggcga gaagggcaac 1500 
gatgacaaca acaaagagaa gatgtccata cgcagcgtcg aagtggaatc ggagtcggtg 156 0 
agcgttatac aaagacaacc agcacctacc acagcacccg ctactaaagt ccgtaaagtt 162 0 
agcacgactt ccttatcctt acctggttca ccatttaacc tacgccgggg atcacgtagt 16 8 0 
tcacacaagt acacaatacg aaatgggcgt ggacgttttg gtataccagg tagcgatcgc 1740 
aagccattgg tactgcaaac atatcaggat gcccagcagc atttgcccta tgccgatgac 1800 
tcgaatgccg taacaccaat gtccgaagag aatggtgcca ttatagtacc agcctactat 1860 
tgtaatttag gttctagaca ttcttcatat acctcgcatc aatcaagaat ctcgtataca 1920 
tcacatggtg atttattggg tggcatggcg gccatgggtg ccagcacaat gaccaaagag 1980 
agcaaattgc gcagtcgcaa cacacgcaat caatcaatcg gtgctgcaac caatggtggc 2 040 
agtagtacgg ctggtggtgg ctatcccgat gccaatcaca aggaacaaag ggattatgaa 2100 
atgggtcagg attatacaga cgaagctggc aaaataaaac accacgacaa tccttttatc 2160 
gagcccgtcc aaactcaaac agtggtagac atgaaagatg ttatggtctt aaatgatatc 2220 
attgaacaag ccgctggtcg gcatagtcgt gctagtgaac gaggtgagga cgatgacgaa 2280 
gatggtccca cattcaagga catcgccctc gaatacatcc taaaaggcat cgaaatcttt 2340 
tgtgtatggg actgttgttg ggtgtggtta aaatttcagg aatgggtgtc ctttattgtg 2400 
ttcgatccat tcgtggagct cttcattacc ctgtgtattg tggtcaatac gatgtttatg 2460 
gccatggatc atcacgacat gaatccggaa ttagagaagg tgctgaaaag tggtaactat 2520 
ttcttcacgg ccacttttgc aattgaagcc agcatgaaac tgatggccat gagcccgaag 2580 
tactacttcc aggaaggctg gaacattttc gatttcatta ttgtggcctt gtctctgctg 2640 
gaattgggcc tggagggtgt ccagggcctg tcggtgttga gaagttttcg tttgcttcgt 2700 
gtattcaaat tggcaaaatc atggcccaca ctcaatttac tcatttcgat tatgggccgg 2760 
acaatgggtg cattgggtaa tctgacattt gtactttgca ttatcatctt catctttgcc 2820 
gtgatgggaa tgcaactttt cggaaagaac tatattgacc acaaggatcg cttcaaggac 2880 
catgaattac cgcgctggaa cttcaccgac ttcatgcaca gcttcatgat tgtgttccga 2940 
gtgctgtgcg gagagtggat cgagtccatg tgggactgca tgtatgtggg cgatgtcagc 3 0 00 
tgtataccct tcttcttggc cacggtcgtg ataggcaatc ttgtggttct taatcttttc 3060 
ttagctttgc ttttgtccaa cttcggttca tctagtttat cagccccgac tgccgacaat 3120 
gataccaata aaatagcaga ggccttcaat cgtattgctc gttttaagaa ctgggtgaaa 3180 
cgtaatattg ccgattgttt taagttaatt cgaaataaat tgacaaatca aataagtgac 3240 
caaccatcag aacatggcga taatgaactg gagttgggtc atgacgaaat catgggcgat 33 00 
ggcttgatca aaaagggtat gaagggcgag acccagctgg aggtggccat tggcgatggc 33 60 
atggagttca cgatacatgg cgatatgaaa aacaacaagc cgaagaaatc aaaattcatg 3420 
aacaacacaa cgatgattgg aaactcaata aaccaccaag acaatagact ggaacatgag 3480 
ctaaaccata gaggtttgtc catacaggac gatgacactg ccagcattaa ctcatatggt 3540 
agccataaga atcgaccatt caaggacgag agccacaagg gcagcgccga gaccatcgag 3600 
ggcgaggaga aacgcgacgt cagcaaagag gacctcggcc tcgacgagga actggacgag 3660 
gaggccgagg gcgatgaggg ccagctggat ggtgacatta tcattcatgc gcaaaacgac 3720 
gacgagataa tcgacgacta tccggccgac tgtttccccg actcgtacta caagaagttt 3780 
ccgatcttgg ccggcgacga ggactcgccg ttctggcaag gatggggcaa tttacgactg 3840 
aaaacttttc aattaattga aaataaatat tttgaaaccg cagttatcac tatgatttta 3900 
atgagtagct tagctttggc cttagaagat gttcatttac ccgatcgacc tgtcatgcag 3960 
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gatatactgt actacatgga caggatattt acggtgatat tctttttgga gatgttgatc 4020 
aaatggttgg ccctgggctt taaggtttac ttcaccaatg cctggtgttg gctggatttc 4080 
gtgattgtca tgctatcgct tataaatttg gttgccgttt ggtcgggctt aaatgatata 4140 
gccgtgttta gatcaatgcg cacactgcgc gccctaaggc cattgcgtgc tgtctctaga 42 00 
tgggagggta tgaaagttgt cgtgaatgcg ctggttcaag ctataccgtc catcttcaat 4260 
gtgctattgg tgtgtctgat attttggctt atttttgcca ttatgggagt acagcttttt 4320 
gctggaaaat attttaagtg taaagatggt aatgacactg tgctgagcca tgaaatcata 43 8 0 
ccgaatcgta atgcctgcaa aagtgaaaac tacacctggg aaaattcggc aatgaacttc 4440 
gatcatgtag gtaatgcgta tctctgtcta tttcaagtgg ccacctttaa gggctggatc 4500 
cagattatga acgatgccat tgattcacga gaggtggaca agcagccgat ccgagaaacc 4560 
aatatctaca tgtatttata tttcgtattc ttcattatat ttggatcatt tttcacactc 4620 
aatctgttca ttggtgttat cattgataat tttaatgaac aaaagaagaa agctggtgga 46 80 
tcattagaaa tgttcatgac agaagatcag aaaaagtact ataatgctat gaaaaagatg 4740 
ggctctaaaa aaccattaaa agccattcca agaccgaggt ggcgaccaca agcaatagta 48 00 
ttcgaaatag ttacagataa aaaattcgat ataatcatta tgttgttcat tggcttaaac 4860 
atgtttacca tgaccctcga tcggtacgac gcctccgagg cgtacaacaa tgtcctcgac 4920 
aaactcaatg ggatattcgt agttattttc agtggcgaat gtctattaaa aatattcgct 4980 
ttacgatatc actatttcaa agagccatgg aatttatttg atgtagtagt tgtcatttta 5040 
tccatcttag gtcttgtact cagcgacatc attgagaagt atttcgtatc gccgacactg 5100 
ctccgtgtgg tgagagtggc caaagtgggt cgtgtcctgc gtttagtcaa gggtgccaag 5160 
ggtatccgga cgttgctgtt cgcgttagcc atgtcgttgc ctgccttatt caacatttgt 5220 
ctgttgctgt tcttggtgat gttcatcttt gctatctttg gcatgtcctt cttcatgcat 5280 
gtcaaagaga agagcggcat aaatgctgtg tataatttta agacatttgg ccaaagtatg 5340 
atattgctgt ttcagatgtc tacctcagcc ggttgggatg gtgtgttaga tgccattatc 54 00 
aatgaggaag attgcgatcc acccgacaac gacaagggct atccgggcaa ttgtggttca 5460 
gcgactgttg gaattacgtt tctcctttca tatctagtta taagcttttt gatagttatt 5520 
aatatgtaca ttgctgtcat tctcgagaac tatagccagg ctacggagga tgtacaggag 5580 
ggtctcaccg acgacgatta cgatatgtac tacgagattt ggcaacaatt cgatccggag 5640 
ggcacccagt acatacgcta cgaccagctg tccgagtttc tggacgtgct ggagccgccg 5 7 00 
ctgcagatcc acaagccgaa caagtacaaa atcatatcga tggacatgcc gatatgtcgg 5760 
ggcgacatga tgtactgtgt ggatatattg gatgccctga ccaaggactt ctttgcgcgc 5820 
aagggtaatc cgatcgagga gacgggtgaa attggtgaga tagcggcgcg accggacacc 58 80 
gagggctatg atccggtgtc gtcaacactg tggcgccagc gtgaggagta ctgcgccaag 5940 
ctgatacaga atgcgtggcg gcgttacaag aatggcccac cccaggaggg tgatgagggc 6000 
gaggcggctg gtggcgaaga tggtgctgaa ggcggtgagg gtgaaggagg cagcggcggc 6 060 
ggcggcggtg atgatggtgg ctcagcgaca ggagcaacgg cggcggcggg agccacatca 6120 
ccctcagatc cagatgccgg cgaagcagat ggtgccagcg tcggcggccc ccttagtccg 6180 
ggctgtgtta gtggcggcag taatggccgc caaacggccg tactggtcga aagcgatggt 6240 
tttgttacaa aaaacggtca taaggttgta atacactcga gatcgccgag cataacatcc 63 00 
aggacggcag atgtctga 6 318 

<210> 2 
<211> 6315 
<212> DNA 

<213> Musca domestica 
<400> 2 

atgacagaag attccgactc gatatctgag gaagaacgca gtttgttccg tcccttcacc 6 0 
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cgcgaatcat tgttacaaat cgaacaacgt 
gaaagaaaga gagccgccga aggagagcag 
ggtccacagc cggatcccac acttgaacag 
agcttcccgc cggaattggc ctccactcct 
gtactgacat ttgtagtaat aagtaaagga 
gcaatgtggc tgctcgatcc attcaatccg 
catcccttgt tttcgttatt cattatcacc 
atgccgacaa cgcccacggt cgaatccaca 
gaatcagctg ttaaagtgat ggcacgaggt 
gatgcatgga attggctgga cttcgtagta 
gatttaggta atctcgcagc tttgagaaca 
gccattgtgc caggtctaaa aaccattgtc 
cgcgatgtga taattttgac aatgttttcc 
atctatatgg gtgttctaac acaaaagtgc 
ggcaatctga ccgatgaaaa ctggtttcta 
gagaacgatg gcgagtcata tccggtgtgc 
gaagattacg tctgcctgca gggcttcggc 
gactcattcg gttgggcttt cctgtcggcg 
gatctgtatc agcacgtgct gcaagcagct 
atcatcttcc taggttcatt ctatcttgtg 
tatgacgaat tgcaaaagaa ggccgaagaa 
gaagctgaag aagcggcagc agccaaggcg 
gctcaagcgg ctcaggatgc agcggatgcc 
aagagtccca cgtactcttg cattagctat 
gatgacaaca acaaggagaa gatgtcgata 
agcgttatac aaagacaacc agcacctacc 
agcacgactt ccttatcctt acctggttca 
tcacacaagt acacaatacg aaatgggcgt 
aagccattgg tactgcaaac atatcaggat 
tcgaatgccg taacaccaat gtccgaagag 
tgtaatttag gttctagaca ttcttcatat 
tcacatggtg atttattggg tggcatggcg 
agcaaattgc gcagtcgcaa cacacgcaat 
agtagtacgg ccggtggtgg ctatcccgat 
atgggtcagg attatacaga cgaagctggc 
gagcccgtcc aaactcaaac agtggtagac 
attgaacaag ccgctggtcg gcatagtcgt 
gatggtccca cattcaagga catcgccctc 
tgtgtatggg actgttgttg ggtgtggtta 
ttcgatccat tcgtggagct cttcattacc 
gccatggatc atcacgacat gaatccggaa 
ttcttcacgg ccacttttgc aattgaggcc 
tactacttcc aggaaggctg gaacattttc 
gaattgggcc tggagggtgt ccagggcctg 
gtattcaaat tggcaaaatc atggcccaca 
acaatgggtg cattgggtaa tctgacattt 
gtgatgggaa tgcaactttt cggaaagaac 
catgaattac cgcgctggaa tttcaccgac 



atcgctgaac atgaaaaaca aaaggagctg 12 0 
atacgatatg atgacgagga cgaagatgaa 18 0 
ggtgtgccta tacctgttcg aatgcagggc 24 0 
ctcgaggata tcgatccctt ctacagtaat 300 
aaggatattt ttcgtttttc tgcctcaaaa 36 0 
atacgtcgtg tagccattta tattttagtg 42 0 
actattctaa ctaattgtat tttaatgata 48 0 
gaggtgatat tcaccggaat ctacacattt 54 0 
ttcattttat gcccgtttac gtatcttaga 600 
atagctttag cttatgtgac catgggcata 66 0 
tttagggtac tgcgagctct gaaaaccgta 72 0 
ggtgctgtca ttgaatctgt aaaaaatcta 78 0 
ctgtcggtgt tcgcgctgat gggcctacaa 84 0 
attaaacgat tccccctgga cggcagttgg 900 
cacaatagca acagttccaa ttggtttacg 96 0 
gggaatgtat ccggtgcggg acaatgcggc 102 0 
cccaatccca actacgacta caccagtttc 1080 
tttcgtctca tgacccaaga tttctgggag 1140 
ggaccctggc acatgttgtt ctttatagtc 1200 
aatttgattt tggccattgt tgccatgtct 1260 
gaagaggctg ccgaggagga ggcgatccga 132 0 
gccaaactgg aggagcgggc caatgtagca 13 8 0 
gctgcggcag ctctgcatcc cgagatggca 1440 
gaactgtttg ttggcggcga gaagggcaac 1500 
cgcagcgtcg aagtggaatc ggagtcggtg 1560 
acagcacccg ctactaaagt ccgtaaagtt 1620 
ccatttaacc tacgccgggg atcacgtagt 1680 
ggacgttttg gtataccagg tagcgatcgc 1740 
gcccagcagc atttgcccta tgccgatgac 18 00 
aatggtgcca ttatagtacc agcctactat 1860 
acctcgcatc aatcaagaat ctcgtataca 1920 
gccatgggtg ccagcacaat gaccaaagag 1980 
caatcaatcg gtgctgcaac caatggtggc 2 040 
gccaatcaca aggaacaaag ggattatgaa 2100 
aaaataaaac accacgacaa tccttttatc 2160 
atgaaagatg ttatggtctt aaatgatatc 2220 
gctagtgaac gaggtgagga cgatgacgaa 2 2 80 
gaatatatcc taaaaggcat cgaaatcttt 2340 
aaatttcagg aatgggtctc ctttattgtg 2400 
ctgtgtattg tggtcaatac aatgttcatg 2460 
ttggagaagg tgctgaaaag tggtaactat 2520 
agcatgaaac tgatggccat gagcccgaag 2 580 
gatttcatta ttgtggcctt gtctctgctg 2640 
tcggtgttga gaagttttcg tttgcttcgt 2700 
ctgaatttac tcatttcgat tatgggccgg 2760 
gtactttgca ttatcatctt catctttgcc 2820 
tatattgacc acaaggatcg cttcaaggac 28 80 
ttcatgcaca gcttcatgat tgtgttccga 2 940 
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gtgctgtgcg gagagtggat cgagtccatg tgggactgca tgtatgtggg cgatgtcagc 3000 
tgtataccct tcttcttggc cacggtcgtg atcggcaatt ttgtggttct taatcttttc 3060 
ttagctttgc ttttgtccaa cttcggttca tctagtttat cagccccgac tgccgacaat 3120 
gataccaata aaatagcaga ggccttcaat cgtattgctc gttttaagaa ctgggtgaaa 318 0 
cgtaatattg ccgattgttt taagttaatt cgaaataaat tgacaaatca aataagtgac 324 0 
caaccatcag aacatggcga taatgaactg gagttgggtc atgacgaaat catgggcgat 3300 
ggcttgatca aaaagggtat gaagggcgag acccagctgg aggtggccat tggcgatggc 33 6 0 
atggagttca cgatacatgg cgatatgaaa aacaacaagc ccaagaaatc aaaattcata 342 0 
aacaacacaa cgatgattgg aaactcaata aaccaccaag acaatagact ggaacatgag 34 8 0 
ctaaaccata gaggtttgtc catacaggac gatgacactg ccagcattaa ctcatatggt 3540 
agccataaga atcgaccatt caaggacgag agccacaagg gcagcgccga gaccatcgag 3600 
ggcgaggaga aacgcgacgt cagcaaagag gacctcggcc tcgacgagga actggacgag 36 6 0 
gaggccgagg gcgatgaggg ccagctggat ggtgacatca tcattcatgc ccaaaacgac 3720 
gacgagataa tcgacgacta tccggccgac tgtttccccg actcgtacta caagaagttt 3780 
ccgatcttgg ccggcgacga ggactcgccg ttctggcaag gatggggcaa tttacgactg 3840 
aaaacttttc aattaattga aaataaatat tttgaaaccg cagttatcac tatgatttta 3900 
atgagtagct tagctttggc cttagaagat gttcatttac ccgatcgacc tgtcatgcag 3960 
gatatactgt actacatgga caggatattt acggtgatat tctttttgga gatgttgatc 402 0 
aaatggttgg ccctgggctt taaggtctac ttcaccaatg cctggtgttg gctggatttc 4080 
gtgattgtca tgctatcgct tataaatttg gttgccgttt ggtcgggctt aaatgatata 414 0 
gccgtgttta gatcaatgcg cacactgcgc gccctaaggc cattgcgtgc tgtctctaga 4200 
tgggagggta tgaaagttgt cgtgaatgcg ctggttcaag ctataccgtc catcttcaat 42 6 0 
gtgctattgg tgtgtctgat attttggctt atttttgcca ttatgggagt acagcttttt 432 0 
gctggaaaat attttaagtg taaagatggt aatgacactg tgctgagcca tgaaatcata 43 8 0 
ccgaatcgta atgcctgcaa aagtgaaaac tacacctggg aaaattcggc aatgaacttc 4440 
gatcatgtag gtaatgcgta tctctgtcta tttcaagtgg ccacctttaa gggctggatc 45 00 
cagattatga acgatgccat tgattcacga gaggtggaca agcagccgat ccgagaaacc 4560 
aatatctaca tgtatttata tttcgtattc ttcattatat ttggatcatt tttcacactc 4620 
aatctgttca ttggtgttat cattgataat tttaatgaac aaaagaagaa agcaggtgga 4680 
tcattagaaa tgttcatgac agaagatcag aaaaagtact ataatgctat gaaaaagatg 4740 
ggctctaaaa aaccattaaa agccattcca agaccgaggt ggcgaccaca agcaatagta 4800 
ttcgaaatag ttacagataa aaaattcgat ataatcatta tgttgttcat tggcttaaac 4860 
atgtttacca tgaccctcga tcggtacgac gcctccgagg cgtacaacaa tgtcctcgac 4920 
aaactcaatg ggatattcgt agttattttc agtggcgaat gtctattaaa aatattcgct 4980 
ttacgatatc actatttcaa agagccatgg aatttatttg atgtagtagt tgtcatttta 5040 
tccatcttag gtcttgtact cagcgacatc attgagaagt atttcgtatc gccgacactg 5100 
ctccgtgtgg tgagagtggc caaagtgggt cgtgtcctgc gtttagtcaa gggtgccaag 5160 
ggtatccgga cgttgctgtt cgcgttagcc atgtcgttgc ctgccttatt caacatttgt 5220 
ctgttgctgt tcttggtgat gttcatcttt gctatctttg gcatgtcctt cttcatgcat 5280 
gtcaaagaga agagcggcat aaatgctgtg tataatttta agacatttgg ccaaagtatg 5340 
atattgctgt ttcagatgtc tacctcagcc ggttgggatg gtgtgttaga tgccattatc 5400 
aatgaggaag attgcgatcc acccgacaac gacaagggct atccgggcaa ttgtggttca 5460 
gcgactgttg gaattacgtt tctcctttca tatctagtta taagcttttt gatagttatt 5520 
aatatgtaca ttgctgtcat tctcgagaac tatagccagg ctacggagga tgtacaggag 5580 
ggtctcaccg acgacgacta tgatatgtac tacgagattt ggcaacaatt cgatccggag 5640 
ggtacccagt acataagata cgaccagctg tccgagttcc tggacgtgct ggagccgccg 5700 
ctgcagatcc acaagccgaa caagtacaaa atcatatcga tggacatgcc gatatgtcgg 5760 
ggcgacatga tgtactgtgt ggatatattg gatgccctga ccaaggactt ctttgcgcgc 5820 
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aagggtaatc cgatcgagga gacgggtgaa attggtgaga ttgcggcgcg accggacacc 58 8 0 

gagggctatg atccggtgtc gtcgacactg tggcgccagc gtgaggagta ctgcgccaag 594 0 

ctgatacaga atgcgtggcg gcgttacaag aatggcccac cccaggaggg tgatgagggc 6000 

gaggcggctg gtggcgaaga tggtgctgaa ggcggtgagg gtgaaggcgg cagcggcggc 6060 

ggcggcgatg atgatggtgg ctcagcgacg gcggcgggag ccacatcacc cacagatcca 6120 

gatgccggcg aagcagatgg tgccagcgcc ggcaatggtg gcggccccct tagtccgggc 6180 

tgtgttagtg gcggcagtaa tggccgccaa acggccgtac tggtcgaaag cgatggtttt 6240 

gttacaaaaa acggtcataa ggttgtaata cactcgagat cgccgagcat aacatccagg 63 00 

acggcagatg tctga 6315 

<210> 3 
<211> 2105 
<212> PRT 

<213> Musca domestica 
<400> 3 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 



Arg Pro Phe Thr Arg Glu Ser Leu Leu Gin lie Glu Gin Arg lie Ala 



Glu His Glu Lys Gin Lys Glu Leu Glu Arg Lys Arg Ala Ala Glu Gly 



Glu Gin lie Arg Tyr Asp Asp Glu Asp Glu Asp Glu Gly Pro Gin Pr 



Asp Pro Thr Leu Glu Gin Gly Val Pro lie Pro Val Arg Met Gin Gly 



Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro 



Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr He Leu Thr Asn Cys He Leu Met He 
145 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 
165 170 175 
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lie Tyr Thr Phe Glu 
180 

Leu Cys Pro Phe Thr 
195 

Val Val lie Ala Leu 
210 

Leu Ala Ala Leu Arg 
225 

Ala lie Val Pro Gly 
245 

Val Lys Asn Leu Arg 
260 

Val Phe Ala Leu Met 
275 

Lys Cys lie Lys Arg 
290 

Asp Glu Asn Trp Phe 
305 

Glu Asn Asp Gly Glu 
325 

Gly Gin Cys Gly Glu 
340 

Pro Asn Tyr Asp Tyr 
355 

Ser Ala Phe Arg Leu 
370 

His Val Leu Gin Ala 
385 

lie He Phe Leu Gly 
405 



Ser Ala Val Lys Val Met 
185 

Tyr Leu Arg Asp Ala Trp 
200 

Ala Tyr Val Thr Met Gly 
215 

Thr Phe Arg Val Leu Arg 
230 235 

Leu Lys Thr He Val Gly 
250 

Asp Val lie He Leu Thr 
265 

Gly Leu Gin He Tyr Met 
280 

Phe Pro Leu Asp Gly Ser 
295 

Leu His Asn Ser Asn Ser 
310 315 

Ser Tyr Pro Val Cys Gly 
330 

Asp Tyr Val Cys Leu Gin 
345 

Thr Ser Phe Asp Ser Phe 
360 

Met Thr Gin Asp Phe Trp 
375 

Ala Gly Pro Trp His Met 
390 395 

Ser Phe Tyr Leu Val Asn 
410 



Ala Arg Gly Phe He 
190 

Asn Trp Leu Asp Phe 
205 

He Asp Leu Gly Asn 
220 

Ala Leu Lys Thr Val 
240 

Ala Val He Glu Ser 
255 

Met Phe Ser Leu Ser 
270 

Gly Val Leu Thr Gin 
285 

Trp Gly Asn Leu Thr 
300 

Ser Asn Trp Phe Thr 
320 

Asn Val Ser Gly Ala 
335 

Gly Phe Gly Pro Asn 
350 

Gly Trp Ala Phe Leu 
365 

Glu Asp Leu Tyr Gin 
380 

Leu Phe Phe He Val 
400 

Leu He Leu Ala He 
415 



Val Ala Met Ser Tyr Asp Glu Leu Gin Lys Lys Ala Glu Glu Glu Glu 
420 425 430 



Ala Ala Glu Glu Glu 
435 

Lys Ala Ala Lys Leu 
450 

Gin Asp Ala Ala Asp 
465 

Lys Ser Pro Thr Tyr 
485 

Glu Lys Gly Asn Asp 
500 

Val Glu Val Glu Ser 

515 

Pro Thr Thr Ala Pro 
530 

Leu Ser Leu Pro Gly 
545 

Ser His Lys Tyr Thr 
565 

Gly Ser Asp Arg Lys 
580 

Gin His Leu Pro Tyr 
595 

Glu Glu Asn Gly Ala 
610 

Ser Arg His Ser Ser 
625 

Ser His Gly Asp Leu 
645 

Met Thr Lys Glu Ser 
660 



Ala lie Arg Glu Ala Glu 
440 

Glu Glu Arg Ala Asn Val 
455 

Ala Ala Ala Ala Ala Leu 
470 475 

Ser Cys lie Ser Tyr Glu 
490 

Asp Asn Asn Lys Glu Lys 
505 

Glu Ser Val Ser Val lie 
520 

Ala Thr Lys Val Arg Lys 
535 

Ser Pro Phe Asn Leu Arg 
550 555 

lie Arg Asn Gly Arg Gly 
570 

Pro Leu Val Leu Gin Thr 
585 

Ala Asp Asp Ser Asn Ala 
600 

lie lie Val Pro Ala Tyr 
615 

Tyr Thr Ser His Gin Ser 
630 635 

Leu Gly Gly Met Ala Ala 
650 

Lys Leu Arg Ser Arg Asn 
665 



Glu Ala Ala Ala Ala 
445 

Ala Ala Gin Ala Ala 
460 

His Pro Glu Met Ala 
480 

Leu Phe Val Gly Gly 
495 

Met Ser lie Arg Ser 
510 

Gin Arg Gin Pro Ala 
525 

Val Ser Thr Thr Ser 
540 

Arg Gly Ser Arg Ser 
560 

Arg Phe Gly lie Pro 
575 

Tyr Gin Asp Ala Gin 
590 

Val Thr Pro Met Ser 
605 

Tyr Cys Asn Leu Gly 
620 

Arg lie Ser Tyr Thr 
640 

Met Gly Ala Ser Thr 
655 

Thr Arg Asn Gin Ser 
670 



lie Gly Ala Ala Thr Asn Gly Gly Ser Ser Thr Ala Gly Gly Gly Tyr 
675 680 685 



Pro Asp Ala Asn His 
690 

Tyr Thr Asp Glu Ala 
705 

Glu Pro Val Gin Thr 
725 

Leu Asn Asp lie lie 
740 

Glu Arg Gly Glu Asp 
755 

Ala Leu Glu Tyr lie 
770 

Cys Cys Trp Val Trp 
785 

Phe Asp Pro Phe Val 
805 

Thr Met Phe Met Ala 
820 

Lys Val Leu Lys Ser 
835 

Glu Ala Ser Met Lys 
850 

Glu Gly Trp Asn lie 
865 

Glu Leu Gly Leu Glu 
885 

Arg Leu Leu Arg Val 
900 

Leu Leu lie Ser lie 
915 

Thr Phe Val Leu Cys 
930 



Lys Glu Gin Arg Asp Tyr 
695 

Gly Lys lie Lys His His 

710 715 

Gin Thr Val Val Asp Met 
730 

Glu Gin Ala Ala Gly Arg 
745 

Asp Asp Glu Asp Gly Pro 
760 

Leu Lys Gly lie Glu lie 
775 

Leu Lys Phe Gin Glu Trp 
790 795 

Glu Leu Phe lie Thr Leu 
810 

Met Asp His His Asp Met 
825 

Gly Asn Tyr Phe Phe Thr 
840 

Leu Met Ala Met Ser Pro 
855 

Phe Asp Phe lie lie Val 
870 875 

Gly Val Gin Gly Leu Ser 
890 

Phe Lys Leu Ala Lys Ser 
905 

Met Gly Arg Thr Met Gly 
920 

lie lie lie Phe lie Phe 
935 



Glu Met Gly Gin Asp 
700 

Asp Asn Pro Phe lie 
720 

Lys Asp Val Met Val 
735 

His Ser Arg Ala Ser 
750 

Thr Phe Lys Asp lie 
765 

Phe Cys Val Trp Asp 
780 

Val Ser Phe lie Val 
800 

Cys lie Val Val Asn 
815 

Asn Pro Glu Leu Glu 
830 

Ala Thr Phe Ala lie 
845 

Lys Tyr Tyr Phe Gin 
860 

Ala Leu Ser Leu Leu 
880 

Val Leu Arg Ser Phe 
895 

Trp Pro Thr Leu Asn 
910 

Ala Leu Gly Asn Leu 
925 

Ala Val Met Gly Met 
940 



Gin Leu Phe Gly Lys Asn Tyr lie Asp His Lys Asp Arg Phe Lys Asp 
945 950 955 960 



His Glu Leu Pro Arg Trp Asn Phe Thr Asp Phe Met His Ser Phe Met 
965 970 975 

lie Val Phe Arg Val Leu Cys Gly Glu Trp lie Glu Ser Met Trp Asp 
980 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys lie Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val lie Gly Asn Leu Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 

Leu Ser Asn Phe Gly Ser Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn 
1025 1030 1035 1040 

Asp Thr Asn Lys lie Ala Glu Ala Phe Asn Arg lie Ala Arg Phe Lys 
1045 1050 1055 

Asn Trp Val Lys Arg Asn lie Ala Asp Cys Phe Lys Leu lie Arg Asn 
1060 1065 1070 

Lys Leu Thr Asn Gin lie Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu lie Met Gly Asp Gly Leu lie Lys 
1090 1095 1100 

Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala lie Gly Asp Gly 
1105 1110 1115 1120 

Met Glu Phe Thr lie His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 
1125 1130 1135 

Ser Lys Phe Met Asn Asn Thr Thr Met lie Gly Asn Ser He Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser He 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser lie Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr He Glu 
1185 1190 1195 1200 
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Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 
1205 1210 1215 



Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 

lie lie lie His Ala Gin Asn Asp Asp Glu lie lie Asp Asp Tyr Pro 
1235 1240 1245 

Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro lie Leu Ala 
1250 1255 1260 

Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1265 1270 1275 1280 

Lys Thr Phe Gin Leu lie Glu Asn Lys Tyr Phe Glu Thr Ala Val lie 
1285 1290 1295 

Thr Met lie Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp lie Leu Tyr Tyr Met Asp Arg 
1315 1320 1325 

lie Phe Thr Val lie Phe Phe Leu Glu Met Leu lie Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 
1345 1350 1355 1360 

Val lie Val Met Leu Ser Leu lie Asn Leu Val Ala Val Trp Ser Gly 
1365 1370 1375 

Leu Asn Asp lie Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala He Pro Ser He Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu lie Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 
1445 1450 1455 
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His Glu lie lie Pro Asn Arg Asn Ala Cys Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 



Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 

Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn 
1490 1495 1500 

Asp Ala lie Asp Ser Arg Glu Val Asp Lys Gin Pro lie Arg Glu Thr 
1505 1510 1515 1520 

Asn lie Tyr Met Tyr Leu Tyr Phe Val Phe Phe lie lie Phe Gly Ser 
1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe lie Gly Val He He Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lys 
1570 1575 1580 

Pro Leu Lys Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val 
1585 1590 1595 1600 

Phe Glu He Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe 
1605 1610 1615 

He Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 

Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly He Phe Val Val 
1635 1640 1645 

He Phe Ser Gly Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu 
1665 1670 1675 1680 

Ser He Leu Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val 
1685 1690 1695 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 
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Leu Arg Leu Val Lys Gly Ala Lys Gly lie Arg Thr Leu Leu Phe Ala 
1715 1720 1725 



Leu Ala Met Ser Leu Pro Ala Leu Phe Asn lie Cys Leu Leu Leu Phe 
1730 1735 1740 

Leu Val Met Phe lie Phe Ala lie Phe Gly Met Ser Phe Phe Met His 
1745 1750 1755 1760 

Val Lys Glu Lys Ser Gly lie Asn Ala Val Tyr Asn Phe Lys Thr Phe 
1765 1770 1775 

Gly Gin Ser Met lie Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1780 1785 1790 

Asp Gly Val Leu Asp Ala lie lie Asn Glu Glu Asp Cys Asp Pro Pro 
1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 

lie Thr Phe Leu Leu Ser Tyr Leu Val lie Ser Phe Leu lie Val lie 
1825 1830 1835 1840 

Asn Met Tyr lie Ala Val lie Leu Glu Asn Tyr Ser Gin Ala Thr Glu 
1845 1850 1855 

Asp Val Gin Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu 
1860 1865 1870 

lie Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr lie Arg Tyr Asp 
1875 1880 1885 

Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin lie His 
1890 1895 1900 

Lys Pro Asn Lys Tyr Lys lie lie Ser Met Asp Met Pro lie Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp lie Leu Asp Ala Leu Thr Lys Asp 
1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro lie Glu Glu Thr Gly Glu lie Gly 
1940 1945 1950 

Glu lie Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 
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Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys Ala Lys Leu lie Gin Asn 
1970 1975 1980 



Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 1995 2000 

Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Gly 
2005 2010 2015 

Gly Ser Gly Gly Gly Gly Gly Asp Asp Gly Gly Ser Ala Thr Gly Ala 
2020 2025 2030 

Thr Ala Ala Ala Gly Ala Thr Ser Pro Ser Asp Pro Asp Ala Gly Glu 
2035 2040 2045 

Ala Asp Gly Ala Ser Val Gly Gly Pro Leu Ser Pro Gly Cys Val Ser 
2050 2055 20S0 

Gly Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly 
2065 2070 2075 2080 

Phe Val Thr Lys Asn Gly His Lys Val Val lie His Ser Arg Ser Pro 
2085 2090 2095 

Ser lie Thr Ser Arg Thr Ala Asp Val 
2100 2105 



<210> 4 
<211> 2104 
<212> PRT 

<213> Musca domestica 



<400> 4 

Met Thr Glu Asp Ser Asp Ser lie Ser Glu Glu Glu Arg Ser Leu Phe 



Arg Pro Phe Thr Arg Glu Ser Leu 
20 

Glu His Glu Lys Gin Lys Glu Leu 
35 40 

Glu Gin lie Arg Tyr Asp Asp Glu 
50 55 

Asp Pro Thr Leu Glu Gin Gly Val 



Leu Gin He Glu Gin Arg lie Ala 
25 30 

Glu Arg Lys Arg Ala Ala Glu Gly 
45 

Asp Glu Asp Glu Gly Pro Gin Pro 
60 

Pro lie Pro Val Arg Met Gin Gly 
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Ser Phe Pro Pro Glu Leu Ala Ser Thr Pro Leu Glu Asp lie Asp Pro 



Phe Tyr Ser Asn Val Leu Thr Phe Val Val lie Ser Lys Gly Lys Asp 
100 105 110 

lie Phe Arg Phe Ser Ala Ser Lys Ala Met Trp Leu Leu Asp Pro Phe 
115 120 125 

Asn Pro lie Arg Arg Val Ala lie Tyr lie Leu Val His Pro Leu Phe 
130 135 140 

Ser Leu Phe lie lie Thr Thr lie Leu Thr Asn Cys lie Leu Met lie 
145 150 155 160 

Met Pro Thr Thr Pro Thr Val Glu Ser Thr Glu Val lie Phe Thr Gly 
165 170 175 

lie Tyr Thr Phe Glu Ser Ala Val Lys Val Met Ala Arg Gly Phe lie 
180 185 190 

Leu Cys Pro Phe Thr Tyr Leu Arg Asp Ala Trp Asn Trp Leu Asp Phe 
195 200 205 

Val Val lie Ala Leu Ala Tyr Val Thr Met Gly lie Asp Leu Gly Asn 
210 215 220 

Leu Ala Ala Leu Arg Thr Phe Arg Val Leu Arg Ala Leu Lys Thr Val 
225 230 235 240 

Ala He Val Pro Gly Leu Lys Thr He Val Gly Ala Val He Glu Ser 
245 250 255 

Val Lys Asn Leu Arg Asp Val lie He Leu Thr Met Phe Ser Leu Ser 
260 265 270 

Val Phe Ala Leu Met Gly Leu Gin He Tyr Met Gly Val Leu Thr Gin 
275 280 285 

Lys Cys He Lys Arg Phe Pro Leu Asp Gly Ser Trp Gly Asn Leu Thr 
290 295 300 

Asp Glu Asn Trp Phe Leu His Asn Ser Asn Ser Ser Asn Trp Phe Thr 
305 310 315 320 

Glu Asn Asp Gly Glu Ser Tyr Pro Val Cys Gly Asn Val Ser Gly Ala 
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325 



330 



335 



Gly Gin Cys Gly 
340 

Pro Asn Tyr Asp 
355 

Ser Ala Phe Arg 
370 

His Val Leu Gin 
385 

lie lie Phe Leu 



Val Ala Met Ser 
420 

Ala Ala Glu Glu 
435 

Lys Ala Ala Lys 
450 

Gin Asp Ala Ala 
465 

Lys Ser Pro Thr 



Glu Lys Gly Asn 
500 

Val Glu Val Glu 
515 

Pro Thr Thr Ala 
530 

Leu Ser Leu Pro 
545 

Ser His Lys Tyr 



Gly Ser Asp Arg 



Glu Asp Tyr Val 



Tyr Thr Ser Phe 
360 

Leu Met Thr Gin 
375 

Ala Ala Gly Pro 
390 

Gly Ser Phe Tyr 
405 

Tyr Asp Glu Leu 



Glu Ala lie Arg 
440 

Leu Glu Glu Arg 
455 

Asp Ala Ala Ala 
470 

Tyr Ser Cys lie 
485 

Asp Asp Asn Asn 



Ser Glu Ser Val 
520 

Pro Ala Thr Lys 
535 

Gly Ser Pro Phe 
550 

Thr lie Arg Asn 
565 

Lys Pro Leu Val 



Cys Leu Gin Gly 
345 

Asp Ser Phe Gly 



Asp Phe Trp Glu 
380 

Trp His Met Leu 
395 

Leu Val Asn Leu 
410 

Gin Lys Lys Ala 
425 

Glu Ala Glu Glu 



Ala Asn Val Ala 
460 

Ala Ala Leu His 
475 

Ser Tyr Glu Leu 
490 

Lys Glu Lys Met 
505 

Ser Val lie Gin 



Val Arg Lys Val 
540 

Asn Leu Arg Arg 
555 

Gly Arg Gly Arg 
570 

Leu Gin Thr Tyr 



Phe Gly Pro Asn 
350 

Trp Ala Phe Leu 
365 

Asp Leu Tyr Gin 



Phe Phe lie Val 
400 

lie Leu Ala lie 
415 

Glu Glu Glu Glu 
430 

Ala Ala Ala Ala 
445 

Ala Gin Ala Ala 



Pro Glu Met Ala 
480 

Phe Val Gly Gly 
495 

Ser lie Arg Ser 
510 

Arg Gin Pro Ala 
525 

Ser Thr Thr Ser 



Gly Ser Arg Ser 
560 

Phe Gly lie Pro 
575 

Gin Asp Ala Gin 
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580 



585 



590 



Gin His Leu Pro 
595 

Glu Glu Asn Gly 
610 

Ser Arg His Ser 
625 

Ser His Gly Asp 



Met Thr Lys Glu 
660 

lie Gly Ala Ala 
675 

Pro Asp Ala Asn 
690 

Tyr Thr Asp Glu 
705 

Glu Pro Val Gin 



Leu Asn Asp lie 
740 

Glu Arg Gly Glu 
755 

Ala Leu Glu Tyr 
770 

Cys Cys Trp Val 
785 

Phe Asp Pro Phe 



Thr Met Phe Met 
820 

Lys Val Leu Lys 



Tyr Ala Asp Asp 
600 

Ala lie He Val 
615 

Ser Tyr Thr Ser 
630 

Leu Leu Gly Gly 
645 

Ser Lys Leu Arg 



Thr Asn Gly Gly 
680 

His Lys Glu Gin 
695 

Ala Gly Lys He 
710 

Thr Gin Thr Val 
725 

He Glu Gin Ala 



Asp Asp Asp Glu 
760 

lie Leu Lys Gly 
775 

Trp Leu Lys Phe 
790 

Val Glu Leu Phe 
805 

Ala Met Asp His 



Ser Gly Asn Tyr 



Ser Asn Ala Val 



Pro Ala Tyr Tyr 
620 

His Gin Ser Arg 
635 

Met Ala Ala Met 
650 

Ser Arg Asn Thr 
665 

Ser Ser Thr Ala 



Arg Asp Tyr Glu 
700 

Lys His His Asp 
715 

Val Asp Met Lys 
730 

Ala Gly Arg His 
745 

Asp Gly Pro Thr 



lie Glu He Phe 
780 

Gin Glu Trp Val 

795 

He Thr Leu Cys 
810 

His Asp Met Asn 
825 

Phe Phe Thr Ala 



Thr Pro Met Ser 
605 

Cys Asn Leu Gly 



He Ser Tyr Thr 
640 

Gly Ala Ser Thr 
655 

Arg Asn Gin Ser 
670 

Gly Gly Gly Tyr 
685 

Met Gly Gin Asp 



Asn Pro Phe He 
720 

Asp Val Met Val 
735 

Ser Arg Ala Ser 
750 

Phe Lys Asp He 
765 

Cys Val Trp Asp 



Ser Phe He Val 

800 

He Val Val Asn 
815 

Pro Glu Leu Glu 
830 

Thr Phe Ala He 
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835 



840 



845 



Glu Ala Ser Met Lys Leu Met Ala Met Ser Pro Lys Tyr Tyr Phe Gin 
850 855 860 

Glu Gly Trp Asn lie Phe Asp Phe lie lie Val Ala Leu Ser Leu Leu 
865 870 875 880 

Glu Leu Gly Leu Glu Gly Val Gin Gly Leu Ser Val Leu Arg Ser Phe 
885 890 895 

Arg Leu Leu Arg Val Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn 
900 905 910 

Leu Leu lie Ser lie Met Gly Arg Thr Met Gly Ala Leu Gly Asn Leu 
915 920 925 

Thr Phe Val Leu Cys lie lie lie Phe lie Phe Ala Val Met Gly Met 
930 935 940 

Gin Leu Phe Gly Lys Asn Tyr lie Asp His Lys Asp Arg Phe Lys Asp 
945 950 955 960 

His Glu Leu Pro Arg Trp Asn Phe Thr Asp Phe Met His Ser Phe Met 
965 970 975 

lie Val Phe Arg Val Leu Cys Gly Glu Trp lie Glu Ser Met Trp Asp 
980 985 990 

Cys Met Tyr Val Gly Asp Val Ser Cys lie Pro Phe Phe Leu Ala Thr 
995 1000 1005 

Val Val lie Gly Asn Phe Val Val Leu Asn Leu Phe Leu Ala Leu Leu 
1010 1015 1020 

Leu Ser Asn Phe Gly Ser Ser Ser Leu Ser Ala Pro Thr Ala Asp Asn 
1025 1030 1035 1040 

Asp Thr Asn Lys lie Ala Glu Ala Phe Asn Arg lie Ala Arg Phe Lys 
1045 1050 1055 

Asn Trp Val Lys Arg Asn lie Ala Asp Cys Phe Lys Leu lie Arg Asn 
1060 1065 1070 

Lys Leu Thr Asn Gin lie Ser Asp Gin Pro Ser Glu His Gly Asp Asn 
1075 1080 1085 

Glu Leu Glu Leu Gly His Asp Glu lie Met Gly Asp Gly Leu lie Lys 
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Lys Gly Met Lys Gly Glu Thr Gin Leu Glu Val Ala He Gly Asp Gly 
1105 1110 1115 1120 

Met Glu Phe Thr He His Gly Asp Met Lys Asn Asn Lys Pro Lys Lys 
1125 1130 1135 

Ser Lys Phe He Asn Asn Thr Thr Met He Gly Asn Ser He Asn His 
1140 1145 1150 

Gin Asp Asn Arg Leu Glu His Glu Leu Asn His Arg Gly Leu Ser He 
1155 1160 1165 

Gin Asp Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys Asn 
1170 1175 1180 

Arg Pro Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr He Glu 
1185 1190 1195 1200 

Gly Glu Glu Lys Arg Asp Val Ser Lys Glu Asp Leu Gly Leu Asp Glu 
1205 1210 1215 

Glu Leu Asp Glu Glu Ala Glu Gly Asp Glu Gly Gin Leu Asp Gly Asp 
1220 1225 1230 

He He lie His Ala Gin Asn Asp Asp Glu He He Asp Asp Tyr Pro 
1235 1240 1245 

Ala Asp Cys Phe Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala 
1250 1255 1260 

Gly Asp Glu Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu 
1265 1270 1275 1280 

Lys Thr Phe Gin Leu He Glu Asn Lys Tyr Phe Glu Thr Ala Val He 
1285 1290 1295 

Thr Met He Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His 
1300 1305 1310 

Leu Pro Asp Arg Pro Val Met Gin Asp He Leu Tyr Tyr Met Asp Arg 
1315 1320 1325 

He Phe Thr Val He Phe Phe Leu Glu Met Leu He Lys Trp Leu Ala 
1330 1335 1340 

Leu Gly Phe Lys Val Tyr Phe Thr Asn Ala Trp Cys Trp Leu Asp Phe 



1345 



1350 



1355 



1360 



Val lie Val Met Leu Ser Leu lie Asn Leu Val Ala Val Trp Ser Gly 
1365 1370 1375 

Leu Asn Asp lie Ala Val Phe Arg Ser Met Arg Thr Leu Arg Ala Leu 
1380 1385 1390 

Arg Pro Leu Arg Ala Val Ser Arg Trp Glu Gly Met Lys Val Val Val 
1395 1400 1405 

Asn Ala Leu Val Gin Ala lie Pro Ser lie Phe Asn Val Leu Leu Val 
1410 1415 1420 

Cys Leu lie Phe Trp Leu lie Phe Ala lie Met Gly Val Gin Leu Phe 
1425 1430 1435 1440 

Ala Gly Lys Tyr Phe Lys Cys Lys Asp Gly Asn Asp Thr Val Leu Ser 
1445 1450 1455 

His Glu lie lie Pro Asn Arg Asn Ala Cys Lys Ser Glu Asn Tyr Thr 
1460 1465 1470 

Trp Glu Asn Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu 
1475 1480 1485 

Cys Leu Phe Gin Val Ala Thr Phe Lys Gly Trp lie Gin lie Met Asn 
1490 1495 1500 

Asp Ala He Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr 
1505 1510 1515 1520 

Asn He Tyr Met Tyr Leu Tyr Phe Val Phe Phe He He Phe Gly Ser 
1525 1530 1535 

Phe Phe Thr Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn 
1540 1545 1550 

Glu Gin Lys Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu 
1555 1560 1565 

Asp Gin Lys Lys Tyr Tyr Asn Ala Met Lys Lys Met Gly Ser Lys Lys 
1570 1575 1580 

Pro Leu Lys Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val 
1585 1590 1595 1600 

Phe Glu He Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe 
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1605 



1610 



1615 



He Gly Leu Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser 
1620 1625 1630 

Glu Ala Tyr Asn Asn Val Leu Asp Lys Leu Asn Gly He Phe Val Val 
1635 1640 1645 

He Phe Ser Gly Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His 
1650 1655 1660 

Tyr Phe Lys Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu 
1665 1670 1675 1680 

Ser He Leu Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val 
1685 1690 1695 

Ser Pro Thr Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val 
1700 1705 1710 

Leu Arg Leu Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala 
1715 1720 1725 

Leu Ala Met Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe 
1730 1735 1740 

Leu Val Met Phe He Phe Ala He Phe Gly Met Ser Phe Phe Met His 
1745 1750 1755 1760 

Val Lys Glu Lys Ser Gly He Asn Ala Val Tyr Asn Phe Lys Thr Phe 
1765 1770 1775 

Gly Gin Ser Met He Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp 
1780 1785 1790 

Asp Gly Val Leu Asp Ala He He Asn Glu Glu Asp Cys Asp Pro Pro 
1795 1800 1805 

Asp Asn Asp Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly 
1810 1815 1820 

He Thr Phe Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He 
1825 1830 1835 1840 

Asn Met Tyr He Ala Val He Leu Glu Asn Tyr Ser Gin Ala Thr Glu 
1845 1850 1855 

Asp Val Gin Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu 



1860 



1865 



1870 



He Trp Gin Gin Phe Asp Pro Glu Gly Thr Gin Tyr He Arg Tyr Asp 
1875 1880 1885 

Gin Leu Ser Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin He His 
1890 - 1895 1900 

Lys Pro Asn Lys Tyr Lys He He Ser Met Asp Met Pro He Cys Arg 
1905 1910 1915 1920 

Gly Asp Met Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp 
1925 1930 1935 

Phe Phe Ala Arg Lys Gly Asn Pro He Glu Glu Thr Gly Glu He Gly 
1940 1945 1950 

Glu He Ala Ala Arg Pro Asp Thr Glu Gly Tyr Asp Pro Val Ser Ser 
1955 1960 1965 

Thr Leu Trp Arg Gin Arg Glu Glu Tyr Cys Ala Lys Leu He Gin Asn 
1970 1975 1980 

Ala Trp Arg Arg Tyr Lys Asn Gly Pro Pro Gin Glu Gly Asp Glu Gly 
1985 1990 1995 2000 

Glu Ala Ala Gly Gly Glu Asp Gly Ala Glu Gly Gly Glu Gly Glu Gly 
2005 2010 2015 

Gly Ser Gly Gly Gly Gly Asp Asp Asp Gly Gly Ser Ala Thr Ala Ala 
2020 2025 2030 

Gly Ala Thr Ser Pro Thr Asp Pro Asp Ala Gly Glu Ala Asp Gly Ala 
2035 2040 2045 

Ser Ala Gly Asn Gly Gly Gly Pro Leu Ser Pro Gly Cys Val Ser Gly 
2050 2055 2060 

Gly Ser Asn Gly Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe 
2065 2070 2075 2080 

Val Thr Lys Asn Gly His Lys Val Val He His Ser Arg Ser Pro Ser 
2085 2090 2095 

He Thr Ser Arg Thr Ala Asp Val 
2100 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Syntheti 
primers for PCR amplification of Vsscl cDNAs . 

<400> 5 

cggttgggct ttcctgtc 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Syntheti 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<221> unsure 
<222> (21) 

<223> N at position 21 is either A, C, G, or T 
<400> 6 

gggaattcra adatrttcca nccytc 

<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthet: 
primers for PCR amplification of Vsscl cDNAs 

<220> 

<221> unsure 
<222> (18) 

<223> N at position 18 is either A, C, G, or T 
<400> 7 

cccgargaya thgaycynta yta 

<210> 8 
<211> 18 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<400> 8 

cgtatcgcct cctcctcg 

<210> 9 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<221> unsure 
<222> (17) 

<223> N at any position in this sequence is A, C, G, or 
T 

<400> 9 

gggtctagat httygcnath ttyggnatg 

<210> 10 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<221> unsure 
<222> (10) 

<223> N at position 10 is either A, C, G, or T 
<400> 10 

ggggaattcn ggrtcraayt gytgcca 

<210> 11 
<211> 27 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<2 21> unsure 
<222> (13) 

<223> N at position 13 is either A, C, G, or T 
<400> 11 

gggtctagar gancaraara artayta 

<210> 12 
<2ll> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<400> 12 

tcatactttg gcccaatgtc 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<400> 13 

cccgaattag agaaggtgct g 

<210> 14 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<400> 14 

25 



actattgctt gtggtcgcca c 
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<210> 15 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<221> unsure 
<222> (5) 

<223> N at any position in this sequence is A, C, G, or 
T 

<400> 15 

catcnttrgc ngcntagacn atgac 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<400> 16 

gattgaatgg atcgagcagc c 

<210> 17 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs. 

<400> 17 

cgtttctcct ttcatatcta g 

<210> 18 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
primers for PCR amplification of Vsscl cDNAs . 

<220> 

<221> unsure 
<222> (11) 

<223> N at any position in this sequence is A, C, G, or 
T 



<400> 18 

ggagbggbgg nckbggnckn gctca 

<210> 19 
<211> 2100 
<212> PRT 

<213> Drosophila melanogaster 



<400> 19 

Met Thr Glu Asp Ser Asp Ser lie 
1 5 

Arg Pro Phe Thr Arg Glu Ser Leu 
20 

Ala Glu His Glu Lys Gin Lys Glu 

35 40 



Ser Glu Glu Glu Arg Ser Leu Phe 
10 15 

Val Gin lie Glu Gin Arg He Ala 
25 30 

Leu Glu Arg Lys Arg Ala Glu Gly 
45 



Glu Val Pro Arg Tyr Gly Arg Lys 
50 55 

Asp Asp Glu Asp Glu Asp Glu Gly 
65 70 

Gin Gly Val Pro He Pro Val Arg 
85 

Leu Ala Ser Thr Pro Leu Glu Asp 
100 



Lys Lys Gin Lys Glu He Arg Tyr 
60 

Pro Gin Pro Asp Pro Thr Leu Glu 

75 80 

Leu Gin Gly Ser Phe Pro Pro Glu 
90 95 

He Asp Pro Tyr Tyr Ser Asn Val 
105 HO 



Leu Thr Phe Val Val Val Ser Lys Gly Lys Asp He Phe Arg Phe Ser 
115 120 125 

Ala Ser Lys Ala Met Trp Met Leu Asp Pro Phe Asn Pro He Arg Arg 
130 135 140 

Val Ala He Tyr He Leu Val His Pro Leu Phe Ser Leu Phe He He 
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145 



150 



155 



160 



Thr Thr lie Leu 



Thr Val Glu Ser 
180 

Ser Ala Val Lys 
195 

Tyr Leu Arg Asp 
210 

Ala Tyr Val Thr 
225 

Thr Phe Arg Val 



Leu Lys Thr lie 
260 

Asp Val lie lie 
275 

Gly Leu Gin lie 
290 

Phe Pro Leu Asp 
305 

Tyr His Asn Arg 



Ser Phe Pro Leu 
340 

Asp Tyr Val Cys 
355 

Thr Ser Phe Asp 
370 

Met Thr Gin Asp 
385 

Ala Gly Pro Trp 



Val Asn Cys lie 
165 

Thr Glu Val He 



Val Met Ala Arg 
200 

Ala Trp Asn Trp 
215 

Met Gly He Asp 
230 

Leu Arg Ala Leu 
245 

Val Gly Ala Val 



Leu Thr Met Phe 
280 

Tyr Met Gly Val 
295 

Gly Ser Trp Gly 
310 

Asn Ser Ser Asn 
325 

Cys Gly Asn He 



Leu Gin Gly Phe 
360 

Ser Phe Gly Trp 
375 

Phe Trp Glu Asp 
390 

His Met Leu Phe 



Leu Met He Met 
170 

Phe Thr Gly He 
185 

Gly Phe He Leu 



Leu Asp Phe Val 
220 

Leu Gly Asn Leu 
235 

Lys Thr Val Ala 
250 

He Glu Ser Val 
265 

Ser Leu Ser Val 



Leu Thr Glu Lys 
300 

Asn Leu Thr Asp 
315 

Trp Tyr Ser Glu 
330 

Ser Gly Ala Gly 
345 

Gly Pro Asn Pro 



Ala Phe Leu Ser 
380 

Leu Tyr Gin Leu 
395 

Phe He Val He 



Pro Thr Thr Pro 
175 

Tyr Thr Phe Glu 
190 

Cys Pro Phe Thr 
205 

Val He Ala Leu 



Ala Ala Leu Arg 
240 

He Val Pro Gly 
255 

Lys Asn Leu Arg 
270 

Phe Ala Leu Met 
285 

Cys He Lys Lys 



Glu Asn Trp Asp 
320 

Asp Glu Gly He 
335 

Gin Cys Asp Asp 
350 

Asn Tyr Gly Tyr 
365 

Ala Phe Arg Leu 



Val Leu Arg Ala 
400 

He Phe Leu Gly 
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405 



410 



415 



Ser Phe Tyr Leu 
420 

Asp Glu Leu Gin 
435 

Ala lie Arg Glu 
450 

Glu Glu Arg Ala 
465 

Ala Glu Glu Ala 



Ser Cys lie Ser 
500 

Asp Asn Asn Lys 
515 

Glu Ser Val Ser 
530 

Gin Ala Thr Lys 
545 

Gly Ser Pro Phe 



Thr lie Arg Asn 
580 

Lys Pro Leu Val 
595 

Tyr Ala Asp Asp 
610 

Ala He He Val 
625 

Ser Tyr Thr Ser 



Leu Leu Gly Gly 



Val Asn Leu He 



Arg Lys Ala Glu 
440 

Ala Glu Glu Ala 
455 

Asn Ala Gin Ala 
470 

Ala Leu His Pro 
485 

Tyr Glu Leu Phe 



Glu Lys Met Ser 
520 

Val lie Gin Arg 
535 

Val Arg Lys Val 
550 

Asn He Arg Arg 
565 

Gly Arg Gly Arg 



Leu Ser Thr Tyr 
600 

Ser Asn Ala Val 
615 

Pro Val Tyr Tyr 
630 

His Gin Ser Arg 

645 

Met Ala Val Met 



Leu Ala He Val 
425 

Glu Glu Glu Ala 



Ala Ala Ala Lys 
460 

Gin Ala Ala Ala 
475 

Glu Met Ala Lys 
490 

Val Gly Gly Glu 
505 

He Arg Ser Val 



Gin Pro Ala Pro 
540 

Ser Thr Thr Ser 
555 

Gly Ser Arg Ser 
570 

Phe Gly He Pro 
585 

Gin Asp Ala Gin 



Thr Pro Met Ser 
620 

Gly Asn Leu Gly 
635 

He Ser Tyr Thr 
650 

Gly Val Ser Thr 



Ala Met Ser Tyr 
430 

Ala Glu Glu Glu 
445 

Ala Ala Lys Leu 



Asp Ala Ala Ala 
480 

Ser Pro Thr Tyr 
495 

Lys Gly Asn Asp 
510 

Glu Val Glu Ser 
525 

Thr Thr Ala His 



Leu Ser Leu Pro 
560 

Ser His Lys Tyr 
575 

Gly Ser Asp Arg 
590 

Gin His Leu Pro 
605 

Glu Glu Asn Gly 



Ser Arg His Ser 
640 

Ser His Gly Asp 
655 

Met Thr Lys Glu 
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660 



665 



670 



Ser Lys Leu Arg 
675 

Asn Gly Gly Thr 
690 

Asp Tyr Glu lie 
705 

His His Asp Asn 



Asp Met Lys Asp 
740 

Gly Arg His Ser 
755 

Gly Pro Thr Phe 
770 

Asp Val Phe Cys 
785 

Glu Trp Val Ser 



Thr Leu Cys lie 
820 

Asp Met Asn Lys 
835 

Phe Thr Ala Thr 
850 

Ser Pro Lys Tyr 
865 

lie Val Ala Leu 



Leu Ser Val Leu 
900 

Lys Ser Trp Pro 



Asn Arg Asn Thr 
680 

Thr Cys Leu Asp 
695 

Gly Leu Glu Cys 
710 

Pro Phe lie Glu 
725 

Val Met Val Leu 



Arg Ala Ser Asp 
760 

Lys Asp Lys Ala 
775 

Val Trp Asp Cys 
790 

Leu lie Val Phe 
805 

Val Val Asn Thr 



Glu Met Glu Arg 
840 

Phe Ala lie Glu 
855 

Tyr Phe Gin Glu 
870 

Ser Leu Leu Glu 
885 

Arg Ser Phe Arg 



Thr Leu Asn Leu 



Arg Asn Gin Ser 



Thr Asn His Lys 
700 

Thr Asp Glu Ala 
715 

Pro Val Gin Thr 
730 

Asn Asp lie lie 
745 

Arg Gly Glu Asp 



Leu Glu Val lie 
780 

Cys Trp Val Trp 
795 

Asp Pro Phe Val 
810 

Met Phe Met Ala 
825 

Val Leu Lys Ser 



Ala Thr Met Lys 
860 

Gly Trp Asn He 
875 

Leu Gly Leu Glu 
890 

Leu Leu Arg Val 
905 

Leu He Ser He 



Val Gly Ala Thr 
685 

Leu Asp His Arg 



Gly Lys He Lys 
720 

Gin Thr Val Val 
735 

Glu Gin Ala Ala 
750 

Asp Asp Glu Asp 
765 

Leu Lys Gly He 



Leu Lys Phe Gin 
800 

Glu Leu Phe He 
815 

Met Asp His His 
830 

Gly Asn Tyr Phe 
845 

Leu Met Ala Met 



Phe Asp Phe He 
880 

Gly Val Gin Gly 
895 

Phe Lys Leu Ala 
910 

Met Gly Arg Thr 
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915 



920 



925 



Met Gly Ala Leu Gly Asn Leu Thr Phe Val Leu Cys lie lie He Phe 
930 935 940 

He Phe Ala Val Met Gly Met Gin Leu Phe Gly Lys Asn Tyr His Asp 
945 950 955 960 

His Lys Asp Arg Phe Pro Asp Gly Asp Leu Pro Arg Trp Asn Phe Thr 
965 970 975 

Asp Phe Met His Ser Phe Met He Val Phe Arg Val Leu Cys Gly Glu 
980 985 990 

Trp He Glu Ser Met Trp Asp Cys Met Tyr Val Gly Asp Val Ser Cys 
995 1000 1005 

He Pro Phe Phe Leu Ala Thr Val Val He Gly Asn Leu Val Val Leu 
1010 1015 1020 

Asn Leu Phe Leu Ala Leu Leu Leu Ser Asn Phe Gly Ser Ser Ser Leu 
1025 1030 1035 1040 

Ser Ala Pro Thr Ala Asp Asn Asp Thr Asn Lys He Ala Glu Ala Phe 
1045 1050 1055 

Asn Arg He Gly Arg Phe Lys Ser Trp Val Lys Arg Asn He Ala Asp 
1060 1065 1070 

Cys Phe Lys Leu He Arg Asn Lys Leu Thr Asn Gin He Ser Asp Gin 
1075 1080 1085 

Pro Ser Glu His Gly Asp Asn Glu Leu Glu Leu Gly His Asp Glu He 
1090 1095 1100 

Leu Ala Asp Gly Leu He Lys Lys Gly He Lys Glu Gin Thr Gin Leu 
1105 1110 1115 1120 

Glu Val Ala He Gly Asp Gly Met Glu Phe Thr He His Gly Asp Met 
1125 1130 1135 

Lys Asn Asn Lys Pro Lys Lys Ser Lys Tyr Leu Asn Asn Ala Thr Asp 
1140 1145 1150 

Asp Asp Thr Ala Ser He Asn Ser Tyr Gly Ser His Lys Asn Arg Pro 
1155 1160 1165 

Phe Lys Asp Glu Ser His Lys Gly Ser Ala Glu Thr Met Glu Gly Glu 
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1170 



1175 



1180 



Glu Lys Arg Asp Ala Ser Lys Glu Asp Leu Gly Leu Asp Glu Glu Leu 
1185 1190 1195 1200 

Asp Glu Glu Gly Glu Cys Glu Glu Gly Pro Leu Asp Gly Asp He He 
1205 1210 1215 

He His Ala His Asp Glu Asp He Leu Asp Glu Tyr Pro Ala Asp Cys 
1220 1225 1230 

Cys Pro Asp Ser Tyr Tyr Lys Lys Phe Pro He Leu Ala Gly Asp Asp 
1235 1240 1245 

Asp Ser Pro Phe Trp Gin Gly Trp Gly Asn Leu Arg Leu Lys Thr Phe 
1250 1255 1260 

Arg Leu He Glu Asp Lys Tyr Phe Glu Thr Ala Val He Thr Met He 
1265 1270 1275 1280 

Leu Met Ser Ser Leu Ala Leu Ala Leu Glu Asp Val His Leu Pro Gin 
1285 1290 1295 

Arg Pro He Leu Gin Asp He Leu Tyr Tyr Met Asp Arg He Phe Thr 
1300 1305 1310 

Val He Phe Phe Leu Glu Met Leu He Lys Trp Leu Ala Leu Gly Phe 
1315 1320 1325 

Lys Val Tyr Leu Thr Asn Ala Trp Cys Trp Leu Asp Phe Val He Val 
1330 1335 1340 

Met Val Ser Leu He Asn Phe Val Ala Ser Leu Val Gly Ala Gly Gly 
1345 1350 1355 1360 

He Gin Ala Phe Lys Thr Met Arg Thr Leu Arg Ala Leu Arg Pro Leu 
1365 1370 1375 

Arg Ala Met Ser Arg Met Gin Gly Met Arg Val Val Val Asn Ala Leu 
1380 1385 1390 

Val Gin Ala He Pro Ser He Phe Asn Val Leu Leu Val Cys Leu He 
1395 1400 1405 

Phe Trp Leu He Phe Ala He Met Gly Val Gin Leu Phe Ala Gly Lys 
1410 1415 1420 

Tyr Phe Lys Cys Glu Asp Met Asn Gly Thr Lys Leu Ser His Glu He 
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1425 



1430 



1435 



1440 



He Pro Asn Arg Asn Ala Cys Glu Ser Glu Asn Tyr Thr Trp Val Asn 
1445 1450 1455 

Ser Ala Met Asn Phe Asp His Val Gly Asn Ala Tyr Leu Cys Leu Phe 
1460 1465 1470 

Gin Val Ala Thr Phe Lys Gly Trp He Gin He Met Asn Asp Ala He 
1475 1480 1485 

Asp Ser Arg Glu Val Asp Lys Gin Pro He Arg Glu Thr Asn He Tyr 
1490 1495 1500 

Met Tyr Leu Tyr Phe Val Phe Phe He He Phe Gly Ser Phe Phe Thr 
1505 1510 1515 1520 

Leu Asn Leu Phe He Gly Val He He Asp Asn Phe Asn Glu Gin Lys 
1525 1530 1535 

Lys Lys Ala Gly Gly Ser Leu Glu Met Phe Met Thr Glu Asp Gin Lys 
1540 1545 1550 

Lys Tyr Tyr Ser Ala Met Lys Lys Met Gly Ser Lys Lys Pro Leu Lys 
1555 1560 1565 

Ala He Pro Arg Pro Arg Trp Arg Pro Gin Ala He Val Phe Glu He 
1570 1575 1580 

Val Thr Asp Lys Lys Phe Asp He He He Met Leu Phe He Gly Leu 
1585 1590 1595 1600 

Asn Met Phe Thr Met Thr Leu Asp Arg Tyr Asp Ala Ser Asp Thr Tyr 
1605 1610 1615 

Asn Ala Val Leu Asp Tyr Leu Asn Ala He Phe Val Val He Phe Ser 
1620 1625 1630 

Ser Glu Cys Leu Leu Lys He Phe Ala Leu Arg Tyr His Tyr Phe He 
1635 1640 1645 

Glu Pro Trp Asn Leu Phe Asp Val Val Val Val He Leu Ser He Leu 
1650 1655 1660 

Gly Leu Val Leu Ser Asp He He Glu Lys Tyr Phe Val Ser Pro Thr 
1665 1670 1675 1680 

Leu Leu Arg Val Val Arg Val Ala Lys Val Gly Arg Val Leu Arg Leu 
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1685 



1690 



1695 



Val Lys Gly Ala Lys Gly He Arg Thr Leu Leu Phe Ala Leu Ala Met 
1700 1705 1710 

Ser Leu Pro Ala Leu Phe Asn He Cys Leu Leu Leu Phe Leu Val Met 
1715 1720 1725 

Phe He Phe Ala He Phe Gly Met Ser Phe Phe Met His Val Lys Glu 
1730 1735 1740 

Lys Ser Gly He Asn Asp Val Tyr Asn Phe Lys Thr Phe Gly Gin Ser 
1745 1750 1755 1760 

Met He Leu Leu Phe Gin Met Ser Thr Ser Ala Gly Trp Asp Gly Val 
1765 1770 1775 

Leu Asp Ala He He Asn Glu Glu Ala Cys Asp Pro Pro Asp Asn Asp 
1780 1785 1790 

Lys Gly Tyr Pro Gly Asn Cys Gly Ser Ala Thr Val Gly He Thr Phe 
1795 1800 1805 

Leu Leu Ser Tyr Leu Val He Ser Phe Leu He Val He Asn Met Tyr 
1810 1815 1820 

He Ala Val He Leu Glu Asn Tyr Ser Gin Ala Thr Glu Asp Val Gin 
1825 1830 1835 1840 

Glu Gly Leu Thr Asp Asp Asp Tyr Asp Met Tyr Tyr Glu He Trp Gin 
1845 1850 1855 

Gin Phe Asp Pro Glu Gly Thr Gin Tyr He Arg Tyr Asp Gin Leu Ser 
1860 1865 1870 

Glu Phe Leu Asp Val Leu Glu Pro Pro Leu Gin He His Lys Pro Asn 
1875 1880 1885 

Lys Tyr Lys He He Ser Met Asp He Pro He Cys Arg Gly Asp Leu 
1890 1895 1900 

Met Tyr Cys Val Asp He Leu Asp Ala Leu Thr Lys Asp Phe Phe Ala 
1905 1910 1915 1920 

Arg Lys Gly Asn Pro He Glu Glu Thr Gly Glu He Gly Glu He Ala 
1925 1930 1935 

Ala Arg Pro Asp Thr Glu Gly Tyr Glu Pro Val Ser Ser Thr Leu Trp 
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1940 



1945 



1950 



Arg Gin Arg Glu Glu Tyr Cys Ala Arg Leu He Gin His Ala Trp Arg 
1955 I960 1965 

Lys His Lys Ala Arg Gly Glu Gly Gly Gly Ser Phe Glu Pro Asp Thr 
1970 1975 1980 

Asp His Gly Asp Gly Gly Asp Pro Asp Ala Gly Asp Pro Ala Pro Asp 
1985 1390 1995 2000 

Glu Ala Thr Asp Gly Asp Ala Pro Ala Gly Gly Asp Gly Ser Val Asn 
2005 2010 2015 

Gly Thr Ala Glu Gly Ala Ala Asp Ala Asp Glu Ser Asn Val Asn Ser 
2020 2025 2030 

Pro Gly Glu Asp Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
2035 2040 2045 

Ala Ala Gly Thr Thr Thr Ala Gly Ser Pro Gly Ala Gly Ser Ala Gly 
2050 2055 2060 

Arg Gin Thr Ala Val Leu Val Glu Ser Asp Gly Phe Val Thr Lys Asn 
2065 2070 2075 2080 

Gly His Lys Val Val He His Ser Arg Ser Pro Ser He Thr Ser Arg 
2085 2090 2095 

Thr Ala Asp Val 
2100 



35 



